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EXPRESSION SYSTEM FOR SEED PROTEINS 

Field of the Invention 

The present invention relates to the production of heterologous polypeptides in 
5 the grains of transgenic plants, methods, vectors and transformed hosts for producing 
the same, and compositions comprising such polypeptides and nucleic acids. 
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Background Of The Invention 

10 Recombinant proteins have been expressed in vitro in different host expression 

systems such as bacterial cells, yeast and other fungi, mammalian cells, insect cells 
and, to a certain extent, plants. Each host expression system has its associated 
advantages and disadvantages. 

Plants are attractive as hosts for expression of recombinant proteins as they are 

1 5 free from animal viruses and from toxins that are sometimes associated with microbial 
hosts. Scale-up can be performed much more easily simply by planting more acreages. 
Further, to the extent that the plant system is edible, recombinant molecules expressed 
in plant hosts may not require substantial purification if the recombinant molecules can 
retain bioactivity upon being ingested. However, up to the present, the level of 

20 expression heterologous proteins in transgenic plants has been low and purification of 
recombinant proteins from portions of the plant, such as leaves, etc., can be costly, 
making such an expression system commercially impractical. 

There is, thus, a need for a reliable method and system for effecting high level 
expression of recombinant or heterologous polypeptides in plants. Such a system may 

25 require a unique or novel combination of components parts such as one or more of: 
promoter, enhancer, transcription factor, codon-optimized heterologous gene, 
terminator, leader sequences, selectable marker, etc., that can operate efficiently 
together. 

Wu, C.Y. et a/., Plant Cell Physiol 39(8):885-889, 1998 compares the activity of 
30 promoters from rice seed storage proteins using the S'-fianking regions of two glutelin 
genes, GluA-3 and G/uB-l; a rice prolamin gene, NPR33; a rice a-globulin gene, Glb\ 
and a rice allergenic protein gene, RAG-1; fused transcriptionally or translationally to 0- 
glucuronidase ("GUS") reporter gene followed by a Nos terminator. Wu et al. concluded 
that, in their hands, "[o]n the basis of averaged values, the 13 kDa prolamin NPR33 and 
35 the a-globulin Gib promoters directed the highest GUS gene expression, whereas the 
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allergenic protein albumin RAG-1 exhibited the lowest promoter activity. The glutelin 
GluA-3 promoter displayed an intermediate level activity." (Page 886) 

international application WO 99/16890 discloses experiments relating to, among 
other things, the transformation of barley with nucleic acid constructs carrying maize 
5 embryo-specific globulin Glb1 promoter, maize ubiquitin Ubi1 promoter, barley 

endosperm-specific hordein promoter or the barley endosperm-specific D-hordein 
promoter, with the reporter uidA gene (encoding GUS activity). The application also 
discloses the use of a hordein signal sequence in conjunction with the B n hordein 
promoter for driving the expression of the uidA gene and concluded that, in their hands, 
10 the use of the Bi hordein signal sequence enhanced expression of the uidA gene under 
the control of the Bi hordein promoter in barley. 

There is a continue need in the art for expression systems including methods, 
vectors, hosts and other compositions, that are capable of effecting higher level 
expression of functional recombinant polypeptides in plants. 

15 

Summary Of The Invention 

It is one of the objects of the present invention to address the unmet need for 
reliable methods for high level expression of heterologous polypeptides in plants. 

It is another one of the objects of the present invention to provide vectors, hosts, 
20 and methods for such expression and compositions containing such. 

In one aspect, the invention includes method of producing a heterologous 
polypeptide in a grain (seed) of a plant. The method includes culturing a transformed 
plant to form a grain-producing transformed plant; and recovering transgenic grains 
containing the heterologous polypeptide from the grain-producing transformed plant. 
25 The transformed plant contains a chimeric gene having (i) a rice glutelin Gt1 promoter or 
a rice globulin Gib promoter, (ii) a nucleic acid leader encoding a rice glutelin Gt1 signal 
peptide; and (Hi) a heterologous nucleic acid that encodes the heterologous polypeptide, 
all operably linked to enable expression of the heterologous polypeptide. 

The plants are obtained by transforming plant cells; culturing the transformed 
30 plant cells to form a plurality of grain-producing transformed plants; and selecting a 

transformed plant that exhibits high level expression of the heterologous polypeptide in 
its grains. 

The transformed plant may further include a transcription factor-encoding nucleic 
acid that encodes one or more heterologous transcription factor that is capable of 
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enhancing expression of the heterologous polypeptide. Exemplary transcription factors 
include 02 or PBF, Reb, or active fragment thereof. 

The plant may be a dicot or monocot, and is preferably a monocot such as rice, 
wheat or barley. The heterologous protein produced in monocots is preferably a non- 
5 plant storage polypeptide, such as an antibody, cytokine, lymphokine, chemokine, 
polypeptide hormones growth factor, coagulation factor, anti-infective, or cytotoxin, a 
non-human animal polypeptide, a milk protein, an anti-inflammatory protein, an intestinal 
trefoil factor (ITF) or an active fragment thereof, and more particularly, lactoferrin, 
lysozyme, lactoferricin, intestinal trefoil factor (ITF), epidermal growth factor (EGF), 
10 keratinocyte growth factor (KG F), insulin-like growth factor I (IGF-I), lactohedrin, kappa- 
casein, haptocorrin, lactoperoxidase, alpha-1 -antitrypsin, immunoglobulins, alpha- 
lactalbumin, beta-lactoglobuiin, alpha-casein, beta-casein, albumin, fibrinogen, or a 
protease inhibitor. 

Exemplary chimeric genes include (a)a chimeric gene comprising a rice glutelin 

15 Gt1 promoter, a rice glutelin Gt1 signal peptide, and a codon optimized lysozyme coding 
sequence; (b) a chimeric gene comprising a rice glutelin Gt1 promoter, a rice glutelin 
Gt1 signal peptide, and a codon optimized lactoferrin coding sequence; (c)a chimeric 
gene comprising a rice glutelin Gt1 promoter, a rice glutelin Gt1 signal peptide, and a 
codon optimized EGF; (d) a chimeric gene comprising a rice glutelin Gt1 promoter, a 

20 rice glutelin Gt1 signal peptide, and a codon optimized IGF-I; (e) a chimeric gene 

comprising a rice glutelin Gt1 promoter, a rice glutelin Gt1 signal peptide, and a codon 
optimized haptocorrin; (f) a chimeric gene comprising a rice glutelin Gt1 promoter, a rice 
glutelin Gt1 signal peptide, and codon optimized IGF-I; (g) a chimeric gene comprising a 
rice glutelin Gt1 promoter, a rice glutelin Gt1 signal peptide, and a codon optimized 

25 lactahedrin;(h)a chimeric gene comprising a rice glutelin Gt1 promoter, a rice glutelin 
Gt1 signal peptide, and a codon optimized lactoperoxidase; and (i) a chimeric gene 
comprising a rice glutelin Gt1 promoter, a rice glutelin Gt1 signal peptide, and a codon 
optimized ai-antitrypsin. 

An exemplary chimeric gene is composed of (a) a rice globulin Gib promoter, a 

30 rice glutelin Gt1 signal peptide, and codon optimized haptocorrin; or (b) a rice globulin 
Gib promoter, a rice glutelin Gt1 signal peptide, and codon optimized lysozyme. 

The transformed plant may include a second chimeric gene, which includes (i) 
an aleurone-specific promoter; and (ii) the heterologous nucleic acid encoding the 
heterologous polypeptide, operably linked together to enable expression of the 

35 heterologous polypeptide. 
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In another aspect, the invention includes an expression vector that contains the 
above chimeric gene, and preferably, also a selectable marker gene. 

in still another aspect, the invention includes a transformed host containing this 
expression vector, the host being a plant cell, a plant or a grain. The transformed host 
5 may also be transformed with one or more coding sequences encoding one or more 
heterologous transcription factors capable of enhancing expression of the heterologous 
polypeptide in the transformed host.. The plant may be a monocot or a dicot, preferably 
a monocot such as rice, barley, and wheat 

In another aspect, the invention includes transformed grains containing the 
1 0 chimeric gene described above. 

In still another aspect, the invention includes a codon optimized nucleic acid 
molecule for expression of the polypeptides: (i)a h LF nucleic acid encoding human 
lactoferrin; as exemplified by SEQ ID NO: 3; (ii) a hLZ nucleic acid encoding human 
lysozyme; as exemplified by SEQ ID NO: 1 , (iti) a hEGF nucleic acid encoding human 
15 epidermal growth factor; as exemplified by SEQ ID NO: 8; and (iv) hIGF-l nucleic acid 
encoding human insulin-like growth factqr t as exemplified by SEQ ID NO: 9. 

Also disclosed is a method for producing a heterologous polypeptide in a grain of 
a plant, comprising the steps of culturing a transformed plant to form a grain-producing 
transformed plant; and recovering grains containing the heterologous polypeptide from 
20 the grain-producing transformed plant. The transformed plant has a first chimeric gene 
and at least one heterologous transcription factor that is capable of enhancing the 
expression of the first chimeric gene. The chimeric gene comprises: (i) a promoter of a 
storage protein gene; (ii) a nucleic acid leader encoding a signal peptide of a storage 
protein; and (iii) a heterologous nucleic acid that encodes the heterologous polypeptide, 
25 all operably linked to enable expression of the heterologous polypeptide. The 

transformed plant may further comprises a second chimeric gene having (i) an aleurone 
specific promoter; and (ii) the heterologous nucleic acid, all operably linked to enable 
expression of the heterologous polypeptide. 

An exemplary storage protein gene may be selected from the group consisting 
30 of: rice glutelins, rice oryzins, rice prolamines, barley hordeins, wheat gliadins, wheat 
glutenins, maize zeins, maize glutelins, oat glutelins, sorghum kafirins, millet 
pennisetins, and rye secalins. Preferred promoters are a rice Gt1 promoter, a rice Gib 
promoter or a wheat glutenin promoter. An exemplary signal peptide is also derived 
from the same gene, preferably a rice Gt1 signal peptide or rice Gib signal peptide. An 
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exemplary heterologous transcription factor may be selected from the group consisting 
of Reb, 02, and PBF. 

An exemplary aleurone specific promoter is the promoter of the lipid transfer 
protein gene Ltp1 or a promoter of chitinase gene Chi26. 
5 The transformed plant may be a dicot or monocot, preferably a cereal monocot 

such as rice, wheat, or barley. 

As above, the heterologous protein produced in monocots is preferably a non- 
plant storage polypeptide, such as an antibody, cytokine, lymphokine, chemokine, 
polypeptide hormones growth factor, coagulation factor, anti-infective, or cytotoxin, a 

10 non-human animal polypeptide, a milk protein, an anti-inflammatory protein, an intestinal 
trefoil factor (ITF) or an active fragment thereof, and more particularly, lactoferrin, 
lysozyme, lactoferricin, intestinal trefoil factor (ITF), epidermal growth factor (EGF), 
keratinocyte growth factor (KGF), insulin-like growth factor I (IGF-I), lactohedrin, kappa- 
casein, haptocorrin, lactoperoxidase, alpha-1 -antitrypsin, immunoglobulins, alpha- 

15 lactalbumin, beta-lactoglobulin, alpha-casein, beta-casein, albumin, fibrinogen, or a 
protease inhibitor. 

In another aspect, the invention includes a first chimeric gene that comprises (i) 
an aleurone-specific promoter and (ii) a heterologous nucleic acid encoding a 
heterologous polypeptide, operably linked to enable expression of the heterologous 

20 polypeptide. The vector may further include a transcription factor-encoding nucleic acid 
that encodes at least one heterologous transcription factor that is capable of enhancing 
expression of the heterologous polypeptide. The vector may further include (i) a 
promoter of a storage protein gene; (ii) a nucleic acid leader encoding a signal peptide 
of a storage protein; and (iii) the heterologous nucleic acid encoding the heterologous 

25 polypeptide, all operably linked to enable expression of the heterologous polypeptide. 

Also disclosed is a transformed host comprising a first chimeric gene comprising 
an aleurone-specific promoter operably linked to a first heterologous polypeptide to 
enable expression of the heterologous polypeptide. The host may further have at least 
one heterologous transcription factor, and a second chimeric gene comprising (i) a 

30 promoter of a storage protein gene, (ii) a nucleic acid leader encoding a signal peptide 
of a storage protein, and (iii) a heterologous polypeptide, all operably linked to enable 
expression of the heterologous polypeptide. Also disclosed is a heterologous 
polypeptide in the above seed . 

A method of processing a transgenic seed that comprises a heterologous 

35 polypeptide, in accordance with another aspect of the invention, includes the steps of 
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(a) powderizing the transgenic seed in a buffer to form bufferized powdered seed; 

(b) removing ceil debris from the bufferized powdered seed to obtain a supernatant; and 
6 (c)reducing liquid from the supernatant to produce an extract that comprises the 

heterologous polypeptide. The buffer may be a volatile buffer such as ammonium 
5 bicarbonate buffer, ammonium acetate buffer 

Another method of processing seeds, in accordance with the invention, includes 
the steps of (a) providing transgenic grains that contain starch and heterologous 
polypeptides; (b)providing an enzymatic composition for conversion of malt to wort; 

(c) combining the transgenic grains with the enzymatic composition under conditions 
10 that allow for conversion of starch in the transgenic grains to sugars; (d)allowing the 

starch in the transgenic grains to be at least partially converted; and (e)separating 
resulting transgenic malt syrup from resulting transgenic grain residue, each containing 
the heterologous polypeptide. The enzymatic composition may include amylases, e.g., 
from a liquid malt extract comprising barley. 

15 In a related aspect, the invention includes a method of processing transgenic 

grains containing a heterologous polypeptide to produce a malt extract. The method 
includes the steps of (a) providing transgenic grains that contain starch and 
heterologous polypeptides; (b) allowing the transgenic grains to undergo a malting 
process to form malted transgenic grains; and (c)crushing the malted transgenic grains. 

20 The method may further include mixing the crushed and malted transgenic 

grains with water to obtain a liquid transgenic malt extract. Preferred seeds are cereal- 
grain seeds, such as corn, sorghum, rice, barley, rye, wheat, oats and triticale. 
In accordance with another aspect of the invention, a transgenic malt syrup or extract 
comprising a heterologous polypeptide produced in transgenic grains in a malt syrup. 

25 Also disclosed is an edible food comprising a heterologous polypeptide in a 

transgenic malt syrup, a transgenic malt extract, or a transgenic grain residue in a food 
product. 

A protein-containing product of the invention includes a monocot seed flour, 
extract or malt composition containing and one or more seed-produced, non-seed 

30 proteins in substantially unpurified form, and a vehicle containing the composition in a 
form suitable for human or animal use. The vehicle is selected from the group 
consisting of a capsule, binder components effective to tabletize the composition, a 
consumable liquid, and a consumable suspension. The vehicle may be a processed 
food in which the composition is mixed. 

35 Exemplary seed-produced, non-seed proteins include: 
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(i) lysozyme and lactoferrin, where the vehicle allows for ingestion; 

(ii) a blood-clotting factor, where the vehicle is a surgical dressing, powder or 

cream. 

(iii) an industrial enzyme, where the extract is in a liquid or powder form, and the 
5 vehicle is a liquid or powder dispenser, respectively. 

(iv) a detergent enzyme, where the vehicle is a detergent, 

(v) an anti-infective, and the vehicle is selected from the group consisting of: 
toothpaste, mouthwash, hand soap, dish soap, dishwasher detergent, tile cleaner, 
bathroom cleaner, toilet cleaner. 

1 0 These and other objects and features of the invention will become more fully 

apparent when the following detailed description of the invention is read in conjunction 
with the accompanying figures. 

Brief Description of the Figures 

1 5 FIG. 1 is a map of the pAPM 59 expression construct that contains the human 

lysozyme coding sequence under the control of a Gt1 promoter and Gt1 signal 
sequence. 

FIG. 2 shows the results of Western blot analysis for the expression of 
recombinant human lysozyme in various tissues of rice plants, where lanes 1 and 15 are 

20 a human milk lysozyme standard; lane 2 is a broad range molecular weight marker from 
Sigma; lanes 3 and 4 represent mature seed tissue extracts; lanes 5 and 6 represent 
germinated seed extracts; lanes 7 and 8 represent root tissue extracts; lanes 9 and 10 
represent extracts from young root tissue; lanes 11 and 12 represent leaf extracts; and 
lanes 13 and 14 represent extracts from young leaf; from untransformed ("U") or 

25 transgenic ("T") plants, respectively. The total loading protein amount was 40 \jg per 
lane. 

FIG. 3 shows the effect of incubating recombinant human lysozyme from 
transgenic rice seed, a human lysozyme standard (30 pg/nil), a control (20 mM sodium 
phosphate, pH 7.0, 5 mM EDTA) or an untransformed rice extract on the growth of E. 
30 coli strain JM109. At the end of the incubation (for the time indicated), an aliquot of the 
mixture was plated on LB plates and colony forming units per ml (CFU/ml) was 
calculated. 

FIG. 4 is a graph showing the specific activity of lysozyme, as determined by 
incubating an identical concentration of a human lysozyme standard, human lysozyme 
35 from transgenic rice (plant) and lysozyme from chicken egg white with a standard 
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amount of M. luteus, followed by evaluation of the reduction in the turbidity due to the 
activity of lysozyme over five minutes. 

FIG. 5A show thermal stability of human lysozyme ("Hlys") and recombinant 
human lysozyme from transgenic rice ("rHLys"). Lysozyme was dissolved at 100 \jg/m\ 
5 in PBS. The mixtures were subjected to different temperatures for different lengths of 
time. At the end of each heat treatment, the remaining lysozyme activity was assessed 
by activity assay. 

FIG. 5B shows pH stability of Hlys and rHIys. Lysozyme was dissolved in 
different buffers at 100 \ig/m\. The mixture was incubated at 37°C for 30 min. The 
10 lysozyme activity was determined by activity assay. 

FIG. 6 presents the results of an analysis of lysozyme expression in transgenic 
rice grains over several generations. Proteins from 1 g of brown rice flour were extracted 
with 40 ml of extraction buffer containing 0.35 M NaCI in PBS. Extraction was conducted 
at room temperature for 1 h with shaking. Homogenate was centrifuged at 14,000 rpm 
15 for 15 min at 4°C. Protein supernatant was removed and diluted as needed for 

lysozyme turbidimetric activity assay. Extraction was repeated three times and standard 
deviation was shown as an error bar. Lysozyme yield was expressed as percentage of 
total soluble protein (%TSP). 

FIG. 7 is a restriction map of the pAP.11.64 plasmid that contains the human 
20 lactoferrin coding sequence under the control of a rice glutelin (Gt1) promoter, aGtl 
signal peptide, and a nopaline synthase (NOS) terminator/polyadenyiation site. 

FIG. 8 shows the results of a SDS-PAGE analysis for human lactoferrin stained 
with Coomassie blue, where lane 1 is the molecular weight marker; lanes 2 - 5 are 
purified human derived lactoferrin (Sigma, USA); lanes 6-10 are single seed extracts 
25 from homozygous transgenic lines and lane 11 is a seed extract from non-transformed 
TP-309. 

FIG. 9 shows the results of a Western blot analysis of various tissues of the 
transgenic rice plants, demonstrating the tissue specificity of rLF expression. Lane 1 is 
the molecular weight marker; lane 2 is human lactoferrin (Sigma, USA); lane 3 is an 
30 extract from leaf; lane 4 is an extract from sheath; lane 5 is an extract from root; lane 6 
is an extract from seed and lane 7 is an extract from 5-day germinated seeds. 

FIG. 10 is a bar diagram illustrating the bactericidal effect of native human 
lactoferrin ("nHLF") and purified recombinant human lactoferrin produced by transgenic 
rice ("rHLF") on growth of E. coli (EPEC) after pepsin/pancreatic treatment. 

35 
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FIG. 1 1 is a graph illustrating pH-dependent iron release by native human 
lactoferrin ("nHLF") and purified recombinant human lactoferrin produced by transgenic 
rice seeds ("rHLF"). 

FIG. 12 shows the binding and uptake of HLf to Caco-2 cells after in vitro 
5 digestion. FIG. 12 A shows the determination of Dissociation constant. FIG. 12B shows 
the number of binding sites for HLf on Caco-2 cells. FIG. 12C shows the total uptake of 
HLf and Fe to Caco-2 cells within 24 h. FIG. 12D shows degradation of HLf after uptake 
into Caco cells determined by the amount of free 125 l in the cell fractions. 

FIG. 13 shows three AAT plasmids: pAPI255 containing Gib promoter, Gib 
10 signal peptide, codon-optimized AAT gene, Nos terminator and ampicillin resistance 
gene; pAPI250 containing Gt1 promoter, Gt1 signal peptide, codon-optimized AAT 
gene, Nos terminator and ampicillin resistance gene; and pAPI282 containing Bx7 
promoter, Bx7 signal peptide, codon-optimized AAT gene, Nos terminator and ampicillin 
resistance gene. 

15 FIG. 14 shows Coomassie brilliant blue staining of aqueous phase extraction of 

transgenic rice cells expressing human AAT. Both untransformed and transgenic rice 
grains were ground with PBS. The resulting extract was spun at 14,000 rpm at 4° C for 
10 min. Supernatant was collected and loaded onto a precast SDS-PAGE gel. 
FIG. 15 shows Western blot analysis of recombinant human AAT from 

20 transgenic rice grains. The extract from transgenic rice grain was separated by SDS- 
PAGE gel and then blotted onto a filter. The identification of AAT in rice grain was 
carried out by anti-AAT antibody by Western analysis. FIG. 16 shows Coomassie 
staining (FIG. 16A) and western blot analysis (FIG. 16B) of protein from transgenic rice 
grains expressing AAT. The activity of r AAT was demonstrated by a band shift assay. 

25 AAT samples from different sources were incubated with equal moles of porcine 

pancreatic elastase (PPE) at 37°C for 15 min. Negative control for band shift assay was 
prepared with the AAT samples incubated with equal volume of PPE added. Lane M is 
molecular weight markers. Lane 1a is purified AAT from human plasma. Lane 1b is 
purified AAT from human plasma + PPE. Lane 2a is protein extract containing AAT 

30 from transgenic rice seed; Lane 2b is protein extract containing AAT from transgenic 
rice seed + PPE. Lane 3a is untransformed, seed extract. Lane 3b is untransformed 
seed extract + PPE. A shifted band was shown in lane 1 b, 2b and 3b in FIG. 16A. The 
shifted band was confirmed to contain AAT entity by Western blot in FIG. 16B. 

FIG. 17A-C are schematic representations of 3 plasmids containing the Reb 

35 coding sequence under the control of 3 different promoters. FIG. 1 7A shows the 
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globulin promoter (Gib), with the Reb gene and the Reb terminator. FIG. 17B shows the 
actin promoter (Act), with the Reb gene and the Reb terminator. FIG. 17C shows the 
native Reb promoter, with the Reb gene and the Reb terminator. 

FIG. 18A-B are schematic depictions of 2 plasmids which contain different 
5 transcription factor coding sequences under the control of the rice endosperm-specific 
glutelin promoter (Gt-1). FIG. 18A shows plasmid pGT1-BPBF (API286) containing the 
Gt1 promoter, barley prolamin box binding factor (BPBF), Nos terminator and 
kanamycin resistance gene. FIG. 18B shows pGT1-PBF (API285) containing the Gt1 
promoter, the maize prolamin box binding factor (PBF), Nos terminator and kanamycin 

10 resistance gene. 

FIG. 19 illustrates the results of an analysis for the expression of recombinant 
human lysozyme in mature seed of T 0 transgenic plants derived from progenitor cells 
transformed with constructs containing the human lysozyme gene expressed under the 
control of the Gib promoter and the Reb gene expressed under the control of its own 

15 promoter ("Native-Reb"). Seeds of 30 plants containing the Reb and lysozyme genes 
and seeds from 17 plants containing only the lysozyme gene were analyzed for 
lysozyme, with twenty individual seeds of each plant analyzed. 

FIG. 20 is a comparison of the codon-optimized epidermal growth factor 
sequence ("Egfactor") with a native epidermal growth factor sequence ("Native Gene"), 

20 aligned to show 53 codons in the mature sequences, with 27 (51%) codon changes and 
30 (19%) nucleotides changes. 

FIG. 21 is a restriction map of the 4,143 bp plasmid, API270 (pGlb-EFG v2.1), 
showing an expression cassette for epidermal growth factor ("EGF"), and containing a 
Gib promoter, a Gib signal peptide, codon optimized EGF, a Nos terminator and an 

25 ampicillin resistance selectable marker. 

FIG. 22 is a restriction map of the 3877 bp plasmid, API303 (pGM-EGF v2.1 ), 
showing an expression cassette for epidermal growth factor (EGF), and containing a 
rice Gt1 promoter, a Gt1 signal peptide, codon optimized EGF, a Nos terminator and an 
ampicillin resistance selectable marker. 

30 FIG. 23 is a Western blot analysis of recombinant human EFG ("rhEGF") in 

transgenic rice seed. Lane 1 shows a broad range of molecular weight markers. Lane 
2 shows rhEGF expressed in yeast, loaded at 125 ng. Lanes 2 to 6 show rhEGF 
expressed from different transgenic rice seeds. Lane 7 is from seeds of control 
untransformed TP 309, 

35 FIG. 24 is a comparison of the codon-optimized insulin-like growth factor I 
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sequence ("Insgfact") with a native human insulin-like growth factor I sequence ("native 
gene"), aligned to show 70 codons in the mature sequences, with 40 (57%) codon 
changes and 47 (22%) nucleotides changes. 

FIG. 25 is a restriction map of the 3928 bp plasmid, API304 (pGt1-IFG.v2.1) f 
5 showing an expression cassette for insulin-like growth factor I ("IGF"), and containing a 
rice Gt1 promoter, a Gt1 signal peptide, codon optimized IGF, a Nos terminator and an 
ampicillin resistance selectable marker. 

FIG. 26 is a restriction map of the 4194 bp plasmid, API271 (pGlb-IGF v2.1), 
showing an expression cassette for insulin-like growth factor I ("IGF"), and containing a 
10 Gib promoter, a Gib signal peptide, codon optimized IGF, a Nos terminator and an 
ampicillin resistance selectable marker. 

FIG. 27 is a Western blot analysis of recombinant human IGF-I ("rhIGF") 
expressed in transgenic rice seeds. Lane 1 shows a broad range of molecular weight 
markers. Lane 2 shows rhIGF expressed in yeast, loaded at 1 /vg. Lanes 3-9 show 
1 5 rhIGF from different transgenic seeds. Lane 10 is from seeds of control untransformed 
TP 309. 

FIG. 28 is a restriction map of the 5250 bp plasmid, API321 (pGlb-gtlsig- 
Haptocorrin v 2.1), showing an expression cassette for haptocorrin, and containing a 
Gib promoter, a Gt1 signal peptide, codon optimized haptocorrin, a Nos terminator and 
20 an ampicillin resistance selectable marker. 

FIG. 29 is a restriction map of the 4948 bp plasmid, API320 (pGtl -Haptocorrin v 
2.1), showing an expression cassette for haptocorrin, and containing a Gt1 promoter, a 
Gt1 signal peptide, codon optimized haptocorrin, a Nos terminator and an ampicillin 
resistance selectable marker. 
25 FIG. 30 is a restriction map of the 4468 bp plasmid, API292 (pGlb-kcasein v2.1), 

showing an expression cassette for kappa-casein ("k-casein"), and containing a Gib 
promoter, a Gib signal peptide, a k-casein gene, a Nos terminator and an ampicillin 
resistance selectable marker. 

FIG. 31 is a restriction map of the 4204 bp plasmid, API297 (pGT1-kaapa- 
30 Casein v2.1), showing an expression cassette for kappa-casein, and containing a Gt1 
promoter, a Gt1 signal peptide, mature kappa-casein polypeptide encoding gene, a Nos 
terminator and an ampicillin resistance selectable marker. 

FIG. 32 is a restriction map of the 4834 bp plasmid, API420 (pGt1-LAD), 
showing an expression cassette for lactahedrin, and containing a Gt1 promoter, a Gt1 
35 signal peptide, lactohedrin gene, a Nos terminator and a kanamycin resistance 
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selectable marker. 

FIG. 33 is a restriction map of the 5638 bp plasmid, API418 (pGT1-LPO~S), 
showing an expression cassette for lactoperoxidase (minus the propeptide), and 
containing a Gt1 promoter, a Gtl signal peptide, lactoperoxidase gene without the 
5 propeptide, a Nos terminator and a kanamycin resistance selectable marker. 

FIG. 34 is a restriction map of the 5801 bp plasmid, API416 (pGtl- 
lactoperoxidase), showing an expression cassette for codon optimized human 
lactoperoxidase, and containing a rice Gt1 promoter, a Gt1 signal peptide, codon 
optimized lactoperoxidase, a Nos terminator and a kanamycin resistance selectable 
1 0 marker. 

FIG. 35 is a restriction map of the 4408 bp plasmid, API230 (pBX7-Lysozyme 
v2.1.1), showing an expression cassette for codon optimized lysozyme, and containing 
a BX-7 promoter, a Gt1 signal peptide, codon optimized lysozyme gene, a Nos 
terminator and an ampicillin resistance selectable marker. 

15 FIG. 36A and B represent schematic diagrams of the map of 2 plasmids, API254 

(FIG. 36A) and API264 (FIG. 36B) containing heterologous protein coding sequences 
under the control of the rice endosperm-specific globulin promoter (Gib), the Gib signal 
peptide, and Nos terminator. API254 contains the lactoferrin coding sequence, and 
API264 contains the human lysozyme coding sequence. 

20 FIG. 37 is a restriction map of the 4271 bp plasmid, API225, showing an 

expression cassette for codon optimized lysozyme, and containing a GT-3 promoter, a 
Gtl signal peptide, codon optimized lysozyme, a Nos terminator and an ampicillin 
resistance selectable marker. 

FIG. 38 is a restriction map of the 4106 bp plasmid, API229, showing an 

25 expression cassette for codon optimized lysozyme, and containing a RP-6 promoter, a 
Gt1 signal peptide, codon optimized lysosyme, a Nos terminator and an ampicillin 
resistance selectable marker. 

FIG. 39 is a comparison of the expression of lysozyme under Gt1 or Gib 
promoter with Gt1 signal peptide or Gib signal peptide. FIG. 39A is a schematic 

30 representation of plasmid AP1 159 that contains Gt1 promoter, Gt1 signal peptide, a 
lysozyme gene and Nos terminator; plasmid API 228 that contains Gib promoter, Gt1 
signal peptide, a lysozyme gene and Nos terminator; and plasmid API264 that contains 
Gib promoter, Gib signal peptide, a lysozyme gene and Nos terminator. FIG. 39B 
shows the activities of lysozyme in lysozyme-positive seeds produced in transgenic rice 

35 plants transformed with API159, API228 and API264. The seeds from multiple lines of 
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each construct were analyzed by the lysozyme activity assay. Individual seeds from 
each plant were analyzed. Seeds lacking detectable amounts of lysozyme were 
excluded. The activities of 20-lysozyme-positive seeds per plant, including both 
hemizygous and homozygous seeds were averaged. The average activities were 
5 plotted on the chart. 

FIG. 40 shows the expression time course of human lysozyme during 
endosperm development in transgenic line. Ten spikelets were harvested at 7, 14, 21 , 
28, 35, 42 and 49 days after pollination ("DAP") and analyzed by the lysozyme activity 
assay. The dark bars were from 159-1-53-16-1. The light bars were from 264-1-92-6-1. 

10 FIG. 41 is a bar graph comparing the level of lysozyme expression in transgenic 

T1 rice seeds under 7 different promoters: Gt1, Gib, GIub-2, Bx7, Gt3, Glub-1 and Rp6. 
All constructs contained a Gt1 signal peptide. 

FIG. 42A-I are a double-stranded depiction of the DNA sequence of the rice 
(Oryza sativa) bZIP protein, designated ( M Reb M ). The gene sequence of 6.227 kb 

1 5 consists of 5 introns and 6 exons flanked by 1 .2kb of the 5* promoter and 1 .2 kb of the 3' 
region. 

FIG. 43 presents a restriction map of the rice Reb gene isolated from BAC clone 

42B9. 

FIG. 44 is a single-stranded depiction of a portion of the DNA sequence of the 

20 Gib promoter with putative Reb binding sites indicated. 

FIG. 45A presents a schematic diagram of the plasmid constructs used for 
transient assays illustrating the transactivation function of the Red gene towards the Gib 
promoter where (1) is a GUS reporter construct with GUS expressed under the control 
of the Gib promoter (Glb-GUS-Nos); (2) is a null promoter construct (Apromoter-Reb- 

25 Term) where the cells were co-bombarded with the Glb-GUS reporter gene construct; 
(3) is an Reb expression construct with Reb expressed under the control of the Gib 
promoter (Glb-Reb-Term), where the cells were co-bombarded with the G/b-GUS-Nos 
reporter gene construct; (4) is an Reb expression construct with Reb expressed under 
the control of the Actin promoter (Act-Reb-Term) where the cells were co-bombarded 

30 with the Glb-GUS-Nos reporter gene construct; and (5) is an Reb expression construct 
with Reb expressed under the control of the native Reb promoter (Native-Reb-Term), 
where the cells were co-bombarded with the Glb-GUS-Nos reporter gene construct. 

FIG. 45B illustrates the relative GUS activity as measured in transient expression 
assays using the effector/reporter combinations shown in Fig. 45A. 
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FIG. 46A presents a schematic diagram of the plasmid constructs used for gain- 
of-function analysis Reb transactivation by transient assay where (1) is a GUS reporter 
construct with GUS expressed under the control of the Gt1 promoter (Gtl-GUS-Nos); 
(2) is Gt1-GUS-Nos where the cells were co-bombarded with an expression construct 
5 which has Reb expressed under the control of the native Reb promoter (native-Reb- 
Term); (3) is a GUS reporter construct with GUS expressed under the control of the Gt1 
promoter modified to contain the Reb response sequence, UAS (Gt1-UAS-GUS-Nos); 
and (4) is modified Gt1-UAS-GUS-Nos where the cells were co-bombarded with native- 
Reb-Term. 

10 FIG. 46B illustrates the relative GUS activity as measured in transient expression 

assays using the reporter constructs shown in FIG. 46A. 

FIG. 47 is a single-stranded depiction of the DNA sequence of the rice (Oryza 

sativa) globulin promoter Gib, with putative binding sites for the 02 transcription factor 

and the prolamin box indicated in the figure. 
1 5 FIG. 48 is a single-stranded depiction of the DNA sequence of the wheat Bx7 

promoter with putative binding sites for the 02 transcription factor and the prolamin box 

indicated in the figure. 

FIG. 49 illustrates the results of transactivation of the Gib, RP6 t PG5a or Bx7 

gene promoter by 02 and PBF in rice immature endosperms, where FIG. 49A is a 
20 schematic diagram of reporter and effector piasmids used in transient expression 

assays. 

FIG. 49B presents the results of transient expression assays following particle 
bombardment of rice immature endosperms, where 02 and PBF were independently 
expressed under the control of the (ubiquitin) Ubi promoter (Ubi:02) and (UbhPBF), 

25 respectively. In addition, the Ubi:02 and UbhPBF constructs were individually co- 
bombarded with Glb/GUS/NOS, RP6/GUS/NOS, PG5a/GUS/NOS or Bx7/GUS/NOS. 
All results are given relative to the GUS/LUX ratio of the Glb/GUS/NOS, RP6/GUS/NOS, 
PG5a/GUS/NOS or Bx7/GUS/NOS construct, respectively. Error bars represents the 
standard deviation of the mean value for at least five independent particle 

30 bombardments. 

FIG. 50 illustrates the results of transactivation of the Gt1 promoter by various 
amounts of 02 or PBF effector plasmid in rice immature endosperm. FIG. 50A is a 
schematic diagram of reporter and effector constructs used in transient expression 
assays. FIG. SOB presents the results of transient expression assays following particle 

35 bombardment of rice immature endosperms where 02 and PBF were independently 



19 



WO 02/064750 



PCT/US02/04909 



expressed under the control of the (ubiquitin) Ubi promoter (Ubi:02) and (UbiPBF), 
respectively. In addition, the Ubi:02 and Ubi:PBF constructs were individually co- 
bombarded with Gt1/GUS/NOS. All results are given relative to the GUS/LUX ratio of 
Gt1/GUS/NGS construct. Error bar represents the standard deviation of the mean value 
5 for at least five independent particle bombardments. 

FIG. 51 illustrates the results of transactivation of the rice glutelin (Gt1) gene 
promoter by the 02, PBF, o2-676 and PBFm transcription factors in rice immature 
endosperm. FIG. 50A is a schematic diagram of the reporter and effector constructs 
used in the transient expression assay. FIG. SOB presents the results of transient 

1 0 expression assays following particle bombardment of rice immature endosperms, with 
02, PBF, o2-676 and PBFm expressed under the control of the 35S promoter (35S;02), 
(35S.PBF); (35S: o2-676); and (35S:PBFm), respectively. In addition, the activation 
based on co-bombard of the combination of 02 plus PBF with Gtl/GUS/NOS was 
evaluated, as was the combination of o2-676 plus PBFm with Gt1/GUS/NOS. The 

15 pAHC18 (Ubi/LUC/NOS) plasmid was used as an internal control for all experiments. 
All results are given relative to the GUS/LUX ratio of Gt1/GUS/NOS construct. Error bar 
represents the standard deviation of the mean value for at least five independent 
particle bombardments. 

FIG. 52 depicts the results of a PCR analysis of T 0 transgenic plants containing 

20 Reb and the human lysozyme gene. Fig. 52A is a diagram that shows the construct 
API266 (native-Reb), the primer positions and the size of the 522 bp amplified fragment, 
where one primer was designed based on the vector sequence and the other using the 
Reb terminator. 

FIG. 52B (Reb) is a diagram that shows the construct API264 (Glb-lys), the 
25 primer positions and the size of the 278 bp amplified fragment. The primers hybridize to 
an internal sequence of the human lysozyme gene. 

FIG. 52C presents the results of a PCR analysis of native-Reb/Glb-Lys co- 
transformed plants where arrows mark the 522 bp fragment of the Reb/vector region 
and the 278 bp fragment derived from the internal sequence of the human lysozyme 
30 gene. 

FIG 53A is a comparison of the PCR amplified 5' flanking region of the CN26 
gene (PCR-Chi26) with the sequences of Chi26 (BLYChi26AJ in GenBank (accession 
number: L34210). Asterisk. (*) represents identical nucleotide. TATA box and E-region 
[Leah, R. et ah, Plant J. 6:579-589 (1994)] are underlined. Transcriptional start site is in 
35 bold and underlined. 
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FIG. 53B is a schematic representation of the expression cassette 
(Chi26/GUS/NOS) under the control of the barley PCR-Chi26 promoter with a Nos 
terminator. The 882 bp PCR-Chi26 fragment was transcriptionally fused to the 
glucuronidase (GUS) gene. 
5 FIG. 54A shows PCR fragment polymorphism of Ltp1 promoter region among 

seven barley varieties. FIG. 54B is a comparison of the PCR-amp!ified 5' flanking 
region of the Ltp1 gene (PCR-Ltp1 ) with the sequence of Ltp1 (BLYLTP1) in GenBank 
(accession number: X60292). TATA box and E-region [Leah, R. et al. (1994)] is 
underlined. Transcriptional start site is in bold and underlined. FIG. 54C is a schematic 

1 0 diagram of the expression cassette (Ltp1/GUS/NOS) under the control of the barley 
PCR-Ltp1 promoter. The 957 bp PCR-Ltp1 fragment was transcriptionally fused to the 
-glucuronidase (GUS) gene. 

FIG. 55 is a Southern analyses of genomic DNA isolated from transgenic rice 
plants containing Ltpt/GUS/NOS or Chi26/GUS/NOS. Fig. 55A shows rice genomic 

15 DNA from CW26/GUS/NOS transgenic lines: 217-1-1, 217-5-1, 217-8-1, and 217-12-1 
(lanes 2-5), digested with Hindlll and EcoRl, and probed with a 3 kb fragment containing 
the PCR-Cft/26 promoter, GUS coding region and the NOS terminator (FIG. 53B). Lane 
1 is molecular weight (MW) marker ( /HindlW). Lane 6 represents non-transformed 
TP309. Lane 7 and 8 show five (5X) and ten (10X) copies of a genome equivalent, 

20 respectively. 

FIG. 55B Rice genomic DNA from transgenic lines: 220-1-1 , 220-2-1 , 220-11-, 
220-13-1, and 220-14-1 (lanes 2-6) was digested with Hind\\\ and EcoRI and probed with 
a 2.9 kb fragment containing the PCR-Ltp1 promoter, GUS coding region and NOS 
terminator (FIG. 54C). Lane 1 indicates the molecular weight (MW) marker ( /HindWX). 

25 Lane 7 indicates non-transformed TP309. Lane 8 and 9 show five (5X) and ten 10X) 
copies of a genome equivalent, respectively. 

FIG. 56 is a restriction map of plasmid pAPI224 (pEM221-GUS v2.1 ) that 
contains the Emp1 promoter, GUS reporter gene, Nos terminator and ampicillin 
resistance selection marker. 

30 FIG. 57 is nucleotide sequence of Emp1 from Lifts, J.C. et al., Plant Mol. Biol. 

19: 335-337 (1992). 

FIG. 58 shows a histochemical analysis of transgenic line 257-10-1 and tissue- 
specific expression of Emp1 /GUS in transgenic rice. Panel a is a section of a mature 
seed. Panel b is a section of a mature embryo. Panel c is a leaf section. Panel d is a 
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root section; Panel e is a stem cross-section. Panel f is a section of a non-transgenic 
rice seed of Taipei 309 as control. 

FIG. 59 is a dark-field photomicrograph of mature seed from transgenic line 257- 
10-1 showing restricted cellular expression of Emp1/GUS to aleurone and embryo. 
5 Panel a is a view of an embryo. Panel b is a view of a thick layer aleurone. Panel c is a 
view of a thin layer aleurone. Panel d is a view of Taipei 309 negative control embryo. 
Panel e is a view of Taipei 309 negative control thick layer aleurone. Panel f is a view of 
Taipei 309 negative control thin layer aleurone. 

10 Detailed Description of the Invention 

I. Definitions 

Unless otherwise indicated, all terms used herein have the same meaning as 
they would to one skilled in the art of the present invention. Practitioners are particularly 
directed to Sambrook et a/. (1989) Molecular Cloning: A Laboratory Manual (Second 

15 Edition), Cold Spring Harbor Press, Plainview, N.Y. and Ausubel FM et al. (1993) 
Current Protocols in Molecular Biology, John Wiley & Sons, New York, N.Y., for 
definitions and terms of the art. It is to be understood that this invention is not limited to 
the particular methodology, protocols, and reagents described, as these may vary. For 
a better understanding of the present invention, the terms used herein has the following 

20 particular meanings: 

As used herein, the term "active" refers to any biological activity associated with 
a particular polypeptide, whether directly or indirectly, and includes but is not limited to 
therapeutic, anti-infective, prophylactic or diagnostic activities. Such activity may be, for 
example, an enzymatic activity associated with human lysozyme. 

25 As used herein, the term "anti-bacterial polypeptide" refers to a polypeptide that 

has bacteriostatic or bactericidal activity. 

As used herein, the term "anti-microbial polypeptide" refers to a polypeptide that 
has anti-microbial activity, including but not limited to anti-bacterial, antiviral, anti-fungal 
and anti-parasitic activities and can include acute phase proteins, cationic anti-microbial 

30 peptides and probiotic proteins. Such anti-microbial polypeptides are capable of 

inhibiting the growth of one or more of Gram-negative bacteria, Gram-positive bacteria, 
fungi (including yeast), parasites (including planaria and nematodes) and viruses. 
Typically, such anti-microbial peptides exhibit selective biological activity against such 
microbes over eukaryotic cells. 
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As used herein, the term "Bar gene" refers to a nucleotide sequence encoding a 
phosphinothricin acetyltransferase enzyme that upon expression confers resistance to 
the herbicide glufosinate-ammonium ("Basta"). 

"Cereal adjuncts" as used herein refers to cereal grains, principally barley, wheat, 
5 rye, oats, maize, sorghum and rice, or processed whole or portions thereof, especially 
the starch fraction, which are added to the barley mash, which allows the barley 
enzymes to hydrolyze both the barley starch and the starch derived from the cereal 
adjunct. 'Transgenic cereal adjuncts" as used herein refers to transgenic cereal grains, 
principally barley, wheat, rye, oats, maize, sorghum and rice, and which is expressing a 
10 recombinant molecule in a grain part, principally the endosperm (starch) layer. 

"Conversion" as used herein refers to the process of starch hydrolysis, usually 
catalyzed by acid or enzyme action, which produces dextrose, maltose, and higher 
polysaccharides from starch. 

Chimeric gene" as defined herein refers to a nucleic acid construct for 
15 introduction into a host and may include a combination of parts of different genes of 
exogenous or autologous origin, including regulatory elements. 

As used herein, the term "expression" with respect to a polypeptide refers to a 
process by which the polypeptide is produced in a host based on the nucleic acid 
sequence of a gene. The process includes both transcription and translation. The term 
20 "expression" may also be used with respect to the generation of RNA from a DNA 
sequence. 

An "expression cassette" is a nucleic acid construct generated recombinantly or 
synthetically, that contains a series of specified nucleic acid elements that can be 
transcribed or translated to produce one or more recombinant polypeptides in a host 

25 expression system. The expression cassette can be incorporated into a plasmid or a 
viral vector, for example, to form an expression vector, or can be integrated into host 
chromosome, mitochondrialDNA, plastid DNA, virus, or nucleic acid fragment, for 
example, by particle bombardment. Typically, the expression cassette includes, among 
other sequences, a promoter, a transcription start, a translation start, a heterologous 

30 gene of interest, a translation terminator and a transcription terminator. Optionally, the 
expression cassette may contain one or more selectable markers. 

An "expression vector" refers to a vector that contains or is suitable for use with 
an expression cassette for expression of heterologous DNA or RNA in a host cell. 
Many prokaryotic and eukaryotic expression vectors are commercially available. 

35 Selection of appropriate expression vectors is within the knowledge of those having skill 
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in the art. Optionally, an expression vector may contain one or more selectable markers 
for selection of hosts that contain the expression vector. 

As used herein, the term "gene" means a DNA or RNA fragment, including 
genomic or complementary sequences, that encodes a polypeptide, and may or may 
5 not include regions preceding and following the coding region, such as 5' untranslated 
(5* UTR) or "leader 11 sequences and 3' UTR or "trailer 11 sequences, as well as intervening 
sequences (introns) between individual coding segments (exons). 

The term "grain," as used herein, refers to the fruit of a mature seed-producing 
plant and includes the seed. In the context of the present invention, the term "grain" and 
10 "seed" is used interchangeably. 

The term "introduced" in the context of inserting a nucleic acid sequence into a 
ceil, includes biological or mechanical means of introduction such as transfection, 
transformation, transduction, electroporation, and particle bombardment. Such 
introduced sequences may be incorporated into a host cell genome such as into the 
1 5 chromosome, plasmid, plastid, or mitochondrial DNA for stable expression, or in the 
form of autonomous replicon, that may be transiently expressed (for example, 
transfected mRNA). 

"Heterologous nucleic acid construct" refers to a nucleic acid molecule, typically 
for introduction into a host cell for expression purposes, that is not native to the host, 
20 although parts of the construct may be native host sequences. 

By "host cell" is meant a cell which contains an introduced vector, plasmid or 
heterologous nucleic acid construct and supports the replication, transcription and/or 
translation, that is, expression thereof. Host cells can be prokaryotic cells, such as £ 
colt\ or eukaryotic cells such as yeast or other fungi, plant, insect, amphibian, or 
25 mammalian cells. Plant host cells can be rnonocotyledenous or dicotyledenous plant 
cells, whether in tissues such as roots or leaves or in grains or seeds. 

"Malt" as used herein is a dried grain resulting from limited germination of cereal 
grains, usually barley, under controlled conditions, but other cereals can be malted as 
well. When malted, the grain produces relatively large amounts of a- and ^-amylases, 
30 which are important in the conversion of starch to dextrins and fermentable sugars. 

The basic "Malting process" consists of three steps: steeping; germination; and 
drying, as described in Hickenbottom, J.W., Cereal Foods World, 41(10): 788-710 
(1996) and QIB Technical Bulletin, Vol. V (3):1-8 (1983). The germinated malt is called 
a "green malt." Drying or kilning of the green malt stops the germination process. The 
35 end result of the malting process is a "dry malted grain" or matted barley. 
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A "Liquid malt extract" results from the dry malted grain being coarsely ground, 
water extracted and concentrated. Many enzymatic systems, such as a- and 0- 
amylases and carbohydrase, are activated during germination. After addition of water 
and during a series of time and temperature changes, the enzyme systems in the 
5 malted grain break down the endosperm cell structure and some of the starch is 

converted into fermentable sugars by these enzymes. If cereal adjuncts, usually corn 
grits, are added at this time to provide additional starch, the resulting malt syrups will be 
mellower and sweeter in flavor than the extracts. After the appropriate conditions are 
met, such as the appropriate extent of hydrolytic activity or liquefaction, the liquid 

1 0 extract, now a "wort" can be moved to an evaporator for conversion to a "malt syrup." 
The remaining grain residue, separated from the liquid extract, can be used as a high 
protein feed, such as for animal feed. 

"Malt extract" refers to viscous concentrates of water extract of dried malt. "Dry 
malt extracts" are produced by spray drying the liquid malt extracts. 

1 5 The term "milk protein" or "milk polypeptide" refers to polypeptides that are 

present in human breast milk or milk of non-human animals, including lactoferrin, 
lysozyme, lactohedrin, haptocorrin, epidermal growth factor, insulin-like growth factor I, 
kappa-casein, immunoglobulins, albumin, lactoglobulin, lactoperoxidase, alpha-1- 
antitrypsin, for example. 

20 As used herein, the terms "native" relative to a given cell, polypeptide, nucleic 

acid, trait or phenotype refers to the form in which that is typically found in nature. 

A nucleic acid is "operably linked" when it is placed into a functional relationship 
with other nucleic acid sequences. For example, DNA for a leader sequence, such as a 
signal peptide, is operably linked to heterologous gene if the combination is expressed 

25 as a preprotein that can be cleaved or directed to the appropriate cell compartment for 
cleavage to generate a desired heterologous polypeptide; a promoter or enhancer is 
operably linked to a coding sequence if it affects the transcription of the sequence; or a 
ribosome binding site is operably linked to a coding sequence if it is positioned so as to 
facilitate translation. Generally, "operably linked" means that the DNA sequences being 

30 linked are contiguous, and, in the case of a leader sequence, contiguous and in reading 
phase. However, some "operably linked' elements, e.g., enhancers, do not have to be 
contiguous. Linking may be accomplished by ligation at convenient restriction sites. If 
such sites do not exist, synthetic oligonucleotide adaptors or linkers may be used in 
accordance with conventional practice. 
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The term "Percent or % homology" is used interchangeably herein with the term 
"% identity" and refers to the ievet of nucleic acid or amino acid sequence identity 
between two or more aligned sequences, when aligned using a sequence alignment 
program. For example, 70% homology means the same thing as 70% sequence identity 
5 determined by a defined algorithm. For purposes herein, a homologue of a given 

sequence has greater than 80% sequence identity over a length of the given sequence 
and include, but are not limited to, 80%, 85%, 90% or 95% or more percent sequence 
identity to a given sequence, such as, the coding sequence for lactoferrin, as described 
herein. 

10 As used herein, a "plant cell" refers to any cell derived from a plant, including 

undifferentiated tissue (e.g., callus) as well as plant seeds, pollen, progagules and 
embryos. 

As used herein, the term "plant" includes reference to whole plants, plant tissues 
and individual plant cells, and progeny of same. Thus, the term includes, without 
15 limitation, leaves, stems, roots, shoots, endosperms, grains, seeds, embryos, 

suspension cultures, meristematic regions, callus tissue, gametophytes, sporophytes, 
pollen, progagules, and microspores. The class of plants includes higher plants 
amenable to transformation techniques, such as monocotyledenous and dicotyledenous 
plants. 

20 As used herein, a "plant cell" refers to any cell derived from a plant. 

As used herein, the term "plasmid" refers to a nucleic acid construct, typically 
circular, double-stranded or single stranded, containing DNA or RNA, used as a vector 
for introduction of foreign or heterologous nucleic acid into a host cell, and which forms 
an extrachromosomal self-replicating genetic element in host cells, such as bacteria or 

25 eukaryotic cells. 

The term "polypeptide" as used herein refers to a compound made up of a 
sequence of amino acid residues linked by peptide bonds and includes peptides, 
oligopeptides, proteins, and active fragments thereof . The term "protein" as used herein 
may be synonymous with the term "polypeptide" or may refer, in addition, to a complex of 

30 two or more polypeptides and may be in primary, secondary or tertiary configuration. 
As used herein, the term "promoter' 1 refers to a nucleic acid sequence 
functioning to direct transcription of a downstream gene. The promoter will generally be 
appropriate to the host cell in which the target gene is being expressed. The promoter, 
together with other transcriptional and translations regulatory nucleic acid sequences 

35 are termed "control sequences". Thus, for example, transcriptional and translational 
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regulatory sequences include, but are not limited to, promoter sequences, ribosomal 
binding sites, transcriptional start and stop sequences, translational start and stop 
sequences, and enhancer or activator sequences. A promoter may be a constitutive 
promoter or inducible, for example, during germination of a seed or during seed 
5 maturation. 

As used herein, the term "purifying" is used interchangeably with the term 
"isolating" and generally refers to any separation of a particular component from other 
components of the environment in which it is found or produced. For example, purifying 
a recombinant protein from plant cells in which it was produced typically means 

10 subjecting transgenic protein-containing plant material to separation techniques such as 
sedimentation, centrifugation, filtration, column chromatography. The results of any of 
such purifying or isolating steps may still contain other components as long as the 
results have less other components ("contaminating components") than before such 
purifying or isolating steps. 

15 As used herein, "recombinant" includes reference to a molecule such as nucleic 

acid or polypeptide or to a host cell containing such, where the molecules are produced 
by introduction into and expression of an introduced nucleic acid sequence in the host 
cell. Thus, for example, recombinant cells express genes that are not found in identical 
form within the native cell or express native genes that are not otherwise expressed to 

20 such a level in the native cell without deliberate human intervention. 

As used herein, the term "seed" refers to all seed components, including, for 
example, the coleoptile and leaves, radicle and coleorhiza, scutulum, starchy 
endosperm, aleurone layer, pericarp and/or testa, either during seed maturation and 
seed germination. In the context of the present invention, the term "seed" and "grain" is 

25 used interchangeably. 

As used herein "seed in a form for use as a food or food supplement" includes, 
but is not limited to, processed seed fractions such as de-hulled whole seed, flour (seed 
that has been de-hulled by milling and grounding into a powder form) a seed protein 
extract (where the protein fraction of the flour has been separated from the 

30 carbohydrate fraction) and/or a purified protein fraction derived from the transgenic 
grain. 

As used herein, the term "selectable marker-encoding nucleotide sequence" or 
"selectable marker refers to a nucleotide sequence or the product thereof that confers 
to a host ceil to which it is introduced the ability to grow in the presence of a selective 
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agent such as antibiotic resistance, including resistance to ampicillin and hygromycin, 
for example. 

As used herein, the term "sequence identity" refers to the degree of similarity or 
identity between two or more nucleic acid or amino acid sequences aligned using a 
5 sequence alignment program. Exemplary computer programs which can be used to 
determine identity between two sequences include, but are not limited to, the suite of 
BLAST programs, e.g., BLASTN, BLASTX, and TBLASTX, BLASTP and TBLASTN, 
publicly available on the Internet at http://www.ncbi.nlm.nih.gov/BLAST/ ". See, also, 
Altschul, S.F. etaL, 1990 and Altschul, S.F. et a/., 1997. 

10 Sequence searches are typically carried out using the BLASTN program when 

evaluating a given nucleic acid sequence relative to nucleic acid sequences in the 
GenBank DNA Sequences and other public databases. The BLASTX program is 
preferred for searching nucleic acid sequences which have been translated in all 
reading frames against amino acid sequences in the GenBank Protein Sequences and 

1 5 other public databases. Both BLASTN and BLASTX are run using default parameters of 
an open gap penalty of 1 1.0, and an extended gap penalty of 1.0, and utilize the 
BLOSUM-62 matrix. [See, Altschul, et ai, 1997.] 

A preferred alignment of selected sequences in order to determine "% identity" 
between two or more sequences, is performed using for example, the CLUSTAL-W 

20 program in MacVector version 6.5, operated with default parameters, including an open 
gap penalty of 10.0, an extended gap penalty of 0.1, and a BLOSUM 30 similarity 
matrix. 

A nucleic acid sequence is considered to be "selectively hybridizable" to a 
reference nucleic acid sequence if the two sequences specifically hybridize to one 

25 gnother under moderate to high stringency hybridization and wash conditions. 

Hybridization conditions are based on the melting temperature (Tm) of the nucleic acid 
binding complex or probe. For example, "maximum stringency" typically occurs at about 
Tm-5°C (5° below the Tm of the probe); "high stringency" at about 5-10° below the Tm; 
"intermediate stringency" at about 10-20° below the Tm of the probe; and "low 

30 stringency" at about 20-25° below the Tm. Functionally, maximum stringency conditions 
may be used to identify sequences having strict identity or near-strict identity with the 
hybridization probe; while high stringency conditions are used to identify sequences 
having about 80% or more sequence identity with the probe. 

Moderate and high stringency hybridization conditions are well known in the art 

35 (see, for example, Sambrook et al, 1989, Chapters 9 and 1 1 , and in Ausubel et a/., 
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1993, expressly incorporated by reference herein). An example of high stringency 
conditions includes hybridization at about 42°C in 50% formamide, 5X SSC, 5X 
Denhardfs solution, 0.5% SDS and 100 ^g/ml denatured carrier DNA followed by 
washing two times in 2X SSC and 0.5% SDS at room temperature and two additional 
times in 0.1X SSC and 0.5% SDS at 42°C. 

As used herein, the terms "transformed" or "transgenic" with reference to a host 
cell means the host cell contains a non-native or heterologous or introduced nucleic acid 
sequence that is absent from the native host cell. Further, "stably transformed" in the 
context of the present invention means that the introduced nucleic acid sequence is 
maintained through two or more generations of the host, which may be due to 
integration of the introduced sequence into the host genome. 

"Transgenic malt extract" as used herein refers to a vicious concentrate of the 
water extract of dried malt which includes a recombinant polypeptide and/or metabolite. 

"Transgenic malt syrup" as used herein refers to vicious concentrate of the water 
extract of dried 'malt' and other cereal grains which includes a recombinant polypeptide 
and/or metabolite. 

As used herein, the term "transgenic plant" refers to a plant that contains 
introduced nucleic acid sequences that are not present in the native ("untransformed") 
plant or plant cell. Thus a plant having within its cells a heterologous polynucleotide is 
referred to herein as a "transgenic plant." The heterologous polynucleotide can be 
either stably integrated into the genome, or can be extra-chromosomal. Preferably, the 
polynucleotide of the present invention i§ stably integrated into the genome such that 
the polynucleotide is passed on to successive generations. The term "transgenic" as 
used herein does not encompass the alteration of the genome (chromosomal or extra- 
chromosomal) by conventional plant breeding methods or by naturally occurring events 
such as random cross-fertilization, non-recombinant viral infection, non-recornbinant 
bacterial transformation, non-recombinant transposition, or spontaneous mutation. 
"Transgenic" is used herein to include any cell, cell line, callus, tissue, plant part or 
plant, the genotype of which has been altered by the presence of the introduced nucleic 
acids including those transgenics initially so altered as well as those created by sexual 
crosses or asexual reproduction of the initial transgenics. Thus, a transgenic or 
transformed plant cell or plant also includes progeny of such cell or plant and progeny 
produced from a breeding program employing one or more transgenic plants as parents 
in one or more crosses and exhibiting an altered phenotype resulting from the presence 
of the introduced sequence. Hence, a transformed or transgenic plant herein includes 
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any plant which has a cell containing introduced nucleic acid sequences, regardless of 
whether the sequence was introduced into the plant directly through transformation or 

by generational transfer from a progenitor cell which originally received the construct by 
direct transformation. 

5 As used herein, the term "vector" refers to a nucleic acid construct designed for 

introduction of a heterologous nucleic acid into host cells. 

II. Expression Systems, Methods and Compositions of the Present Invention 

The present invention provides for nucleic acid constructs, vectors, expression 

10 systems and methods for high level expression of recombinant polypeptides in plants, 
including monocots and dicots, and compositions containing such as well as 
compositions resulting from such expression. For example, the monocots are cereals 
including rice, wheat/barley, oats, millet, sorghum and corn; and the dicots are plants 
such as tobacco or cotton. Some of the same or similar nucleic acid constructs as 

15 made for expression of heterologous polypeptides in monocots can be used for 

expression in dicots as well. Preferably, monocot regulatory sequences are used in 
monocots and dicot regulatory sequences are used in dicots. Further genes to be 
expressed in monocots are codon-optimized for expression in monocots, and genes to 
be expressed in dicots are codon-optimized for expression in dicots. 

20 In practicing the invention, a heterologous polypeptide is produced in the seeds 

or grain of transgenic plants which express the nucleic acid coding sequence for the 
heterologous polypeptide. The transgenic grains may thereafter be orally delivered, 
such as, for example, as a tablet or capsule or an extract or malt syrup for use as a food 
additive or supplement to produce an "improved food compositions". 

25 More specifically, the present invention is based on the expression of nucleic 

acid molecules encoding heterologous polypeptides which are each linked to a signal 
peptide for directing the expressed polypeptide to the endosperm, under the control of 
one or more seed or maturation specific promoters, such as a promoter from a seed 
storage protein, with or without the addition of one or more transcription factors. 

30 The expression of heterologous polypeptides in plants pursuant to the present 

invention is exemplified herein in detail by the expression of recombinant codon 
optimized human lactoferrin (hLF), human lysozyme (hLys) and a^antitrypsin. Plasmids 
suitable for expression of other recombinant polypeptides are also exemplified. It is 
understood that other polypeptides can be expressed in a similar fashion by substituting 
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the gene in the present expression cassettes with the gene of interest, with or without 
codon optimization. 

The present invention provides food supplement compositions (also termed 
"improved food compositions" comprising heterologous polypeptides and methods of 
5 making such compositions. In practicing the invention, a heterologous polypeptide is 
produced in the seeds or grain of transgenic plants which express the nucleic acid 
coding sequences for the heterologous polypeptides and the transgenic grains added to 
a food such as an infant formula, bakery goods, etc., to result in an "improved food 
compositions". More specifically, the invention is based on the expression of 

10 heterologous polypeptides in grains or seeds of plants, such as monocot plants, as 
exemplified herein by the expression of human lactoferrin (hLF), human lysozyme and 
human ai-antitrypsin, under the control of a seed specific promoter in rice. The 
heterologous polypeptides produced by transgenic plants are compared to the native 
form of the same protein, information on the stability of the recombinant protein and the 

15 advantages of using rice, cereal or other grain containing such recombinant 

heterologous polypeptides in foods and in industrial applications, are further described 
below. 

The invention relies on the use of heterologous nucleic acid constructs including 
the coding sequences for commercially important heterologous polypeptides of 
20 nutritional, therapeutic and/or industrial value, as exemplified herein by lactoferrin and 
lysozyme. 

The improved food compositions of the invention include milk proteins such as 
lactoferrin, and lysozyme, produced in the grain of transgenic plants, which are useful for 
improved nutrition. In one preferred approach, the improved food compositions are 
25 administered to an infant. Typically the improved food compositions, e.g., infant formula 
contain one or more recombinant human milk proteins in an amount that corresponds to 
the amount and proportions of the same human milk proteins found in endogenous 
human milk. 

30 A. Lysozyme 

Human milk lysozyme, called muramidase or peptidoglycan N-acetylmuramoyl- 
hydrolase (EC 3.2.1.17) contains 130 amino acid residues and is a protein of 14.7 kDa in 
size. Human lysozyme is non-giycosylated and possesses unusual stability in vitro and 
in vivo due to its amino acid and secondary structure. 
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Lysozyrries act as enzymes th^t cleave peptidoglycans, and ubiquitous cell wall 
component of microorganisms, in particular bacteria. Specifically, lysozymes are 1 ,4- - 
acetylmuramidases that hydrolyze the glycoside bond between N-acetylmuramic acid 
and N-acetylglucosamine. Gram-positive bacteria are highly susceptible to lysozyme 
5 due to the polypeptidoglycan on the outside of the cell wall. Gram-negative strains have 
a single polypeptidoglycan layer covered by lipopolysaccharides and are therefore less 
susceptible to lysis by lysozyme, however, the sensitivity can be increased by the 
addition of EDTA (Schutte and Kula, 1990). 

The recombinant human lysozyme expressed according to the present invention 
1 0 was tested for its functional activity using an anti-microbial assay developed and 
described herein. 

B. Lactoferrin 

Lactoferrin is an iron-binding protein found in the granules of neutrophils where it 
1 5 apparently exerts an antimicrobial activity by withholding iron from ingested bacteria and 
fungi; it also occurs in many secretions and exudates (milk, tears, mucus, saliva, bile, 
etc.). In addition to its role in iron transport, lactoferrin has bacteriostatic and bactericidal 
activities, in addition to playing a role as an anti-oxidant (Satue-Gracia et a/., 2000). 

The mature lactoferrin (LF) polypeptide consists of 692 amino acids, consists of a 
20 single-chain polypeptide that is relatively resistant to proteolysis, is glycosylated at two 
sites (N138 and N478) and has a molecular weight of about 80 kD. 

C. Lactoperoxidase is an enzyme which catalyzes the conversion of hydrogen peroxide 
to water. This enzyme is found in human milk, and plays host defensive roles through 

25 antimicrobial activity. When hydrogen peroxide and thiocyanate are added to raw milk, 
the SCN's oxidized by the enzyme-hydrogen peroxide complex producing bactericidal 
compounds which destroy Gram-negative bacteria (Shin). 

D. Kappa-Casein. This group of proteins are readily digested and account for almost 
30 half of the protein content in human milk They are important as nutritional protein for 

breast-fed infants. It has also been advocated that part of the antimicrobial activity of 
human milk resides in the caseins, most likely the glycosylated kappa-casein. 

E. Alpha-1 -antitrypsin ("AAT"). AAT belongs to the class of serpin inhibitors, has a 
35 molecular mass of 52 kD, and contains about 15% carbohydrate (Carrefl et al, 1983). 
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While the binding affinity of AAT is highest for human neutrophil elastase, it also has 
affinity for pancreatic proteases such as chymotrypsin and trypsin (Beatty et a!., 1980). 

F. Lactadherin is a protective glycoprotein present in human milk that helps protect 

5 breasMed infants against infection by microorganisms. Protection against certain virus 
infections by human milk is also associated with lactadherin. (Newburg, 1999, 1998; 
Peterson; Hamosh). 

G. Epidermal Growth Factor and Insulin-like Growth, Factor- 1 are two growth factors 
10 present in human milk. These mplecules may stimulate growth and development of the 

infant gastrointestinal tract. (Murphy; Prosser). 

H. Immunoglobulins present in human act to confer resistance to a variety of pathogens 
to which the mother may have been exposed. (See, for example, Humphreys; Kortt; 

15 Larrick; Maynard; and Peeters). 

HI. Expression Vectors For Generation Of Transgenic Plants Expressing Human Milk 
Proteins 

Expression vectors for use in the present invention are chimeric nucleic acid 
20 constructs (or expression vectors or cassettes), designed for operation in plants, with 
associated upstream and downstream sequences. 

In general, expression vectors for use in practicing the invention include the 
following operably linked components that constitute a chimeric gene: (i) a seed 
maturation-specific promoter from a plant, (ii) operably linked to a leader DNA encoding 
25 a monocot seed-specific transit sequence capable of targeting a linked polypeptide to a 
seed of the plant, such as the leader sequence for targeting to a protein-storage body, 
and (iii) a heterologous polypeptide-encoding sequence. 

The chimeric gene, in turn, is typically placed in a suitable plant-transformation 
("expression") vector having (i) companion sequences upstream and/or downstream of 
30 the chimeric gene which are of plasmid or viral origin and provide necessary 

characteristics to the vector to permit the vector to move DNA from one host to another, 
such as from bacteria to a desired plant host; (ii) a selectable marker sequence; and (iii) a 
transcriptional termination region with or without a polyA tail. 

Exemplary methods for constructing chimeric genes and transformation vectors 
35 carrying the chimeric genes are given in the examples below. 
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A. Promoters 

In one aspect of this embodiment, the expression construct includes promoters 
from genes that are upregulated activity during seed maturation. Examples of such 
5 promoters include, but are not limited to the maturation-specific promoter associated 
with one of the following maturation-specific monocot storage proteins: rice glutelins, 
oryzins, and prolamines, barley hordeins, wheat giiadins and glutenins, maize zeins and 
glutelins, oat glutelins, and sorghum kafirins, millet pennisetins, rye secalins. Also 
included herein are aleurone and embryo specific promoters associated with the rice, 

10 wheat and barley genes such as lipid transfer protein Ltp1, chitinase Chi26 (Hwang, Y.S. 
et al. f Plant Cell Rep. 20: 647-654 (2001)), and Em protein Emp1 (Litts, J.C., et al., Plant 
Mol, BioL 19: 335-337 (1992)). Exemplary regulatory regions from these genes are 
exemplified by SEQ ID NOS: 15-23, as identified in the Description of the Sequences. 
In one embodiment of the present invention, a heterologous nucleic acid 

1 5 encoding a heterologous polypeptide is expressed under the control of a promoter from 
a transcription initiation region that is preferentially expressed in plant seed tissue. 
Examples of such seed preferential transcription initiation sequences include those 
derived from sequences encoding plant storage protein genes or from genes involved in 
fatty acid biosynthesis in oilseeds. Exemplary preferred promoters include a glutelin (Gt- 

20 1) promoter, as exemplified by SEQ ID NO: 18, which effects gene expression in the 
outer layer of the endosperm and a globulin (Gib) promoter, as exemplfieid by SEQ ID 
NO: 16, which effects gene expression in the center of the endosperm. Promoter 
sequences for regulating transcription of gene coding sequences operabiy linked thereto 
include naturally-occurring promoters, or regions thereof capable of directing seed- 

25 specific transcription, and hybrid promoters, which combine elements of more than one 
promoter. Methods for construction such hybrid promoters are well known in the art. 

In some cases, the promoter is derived from the same plant species as the plant 
cells into which the chimeric nucleic acid construct is to be introduced. Promoters for 
use in the invention are typically derived from cereals such as rice, barley, wheat, oat, 

30 rye, corn, millet, triticale or sorghum. 

Alternatively, a seed-specific promoter from one type of plant, such as a 
monocot, may be used to regulate transcription of a nucleic acid coding sequence from a 
different plant, whether a cereal monocot, a non-cereal monocot or a dicot. 

Numerous types of appropriate expression vectors, and suitable regulatory 

35 sequences are known in the art for a variety of plant hpst cells. The transcription 
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regulatory or promoter region is chosen to be regulated in a manner allowing for 
induction under seed-maturation conditions. Other promoters suitable for expression in 
maturing seeds include the barley endosperm-specific B1-hordein promoter (Brandt, A., 
et al., (1985), Glub-2 promoter, Bx7 promoter, Gt3 promoter, Glub~1 promoter and Rp-6 
5 promoter, particularly if these promoters are used in conjunction with transcription 

factors. The primary structure of a B1 hordein gene from barley is provided in Carlsberg 
Res. Commun. 50, 333-345). 

B. Siqnal/tarqetinq/transport Sequences 

10 In addition to encoding the protein of interest, the expression cassette or 

heterologous nucleic acid construct may encode a signal/targeting/transport peptide that 
allows processing and translocation of the protein, as appropriate. Exemplary 
signal/targeting/transport sequences, particularly for targeting proteins to intracellular 
bodies, such as vacuoles, are signal/targeting sequences associated with the monocot 

15 maturation-specific genes: glutelins, prolamines, hordeins, gliadins, glutenins, zeins, 

albumin, globulin, ADP glucose pyrophosphorylase, starch synthase, branching enzyme, 
Em, and lea. Exemplary sequences encoding a leader sequence for protein storage 
body are identified herein as SEG ID NOS: 24-30. 

In one embodiment of the present invention, the method is directed toward the 

20 localization of heterologous polypeptide expression in a given subcellular compartment 
or tissue, such as protein-storage body, aleurone layers or embryo, but also including 
other compartments such as vacuoles, chloroplasts or other plastidic compartments. 
For example, when heterologous polypeptide expressed is targeted to plastids, such as 
chloroplasts, the construct employs the use of sequences to direct the gene to the 

25 plastid. Such sequences are for example chloroplast transit peptides (CTP) or plastid 
transit peptides (PTP). In this manner, when the gene of interest is not directly inserted 
into the plastid, the expression construct additionally contains a gene encoding a transit 
peptide to direct the gene of interest to the plastid. The chloroplast transit peptides may 
be derived from the gene of interest, or may be derived from a heterologous sequence 

30 having a CTP. Such transit peptides are known in the art. See, for example, Von Heijne 
et al, 1991; Clark et al., 1989; della-Cioppa et a/., 1987; Romer et al, 1993; and Shah et 
al, 1 986. Additional transit peptides for the translocation of the protein to the 
endoplasmic reticulum (ER) (Chrispeels, K., 1991), nuclear localization signals (Raikhel, 
1992), or vacuole may also find use in the constructs of the present invention. 
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Another exemplary class of signal/targeting/transport sequences are sequences 
effective to promote secretion of heterologous protein from aleurone cells during seed 
germination, including the signal sequences associated with a~amylase, protease, 
carboxypeptidase, endoprotease, ribonuclease, DNase/RNase, (1-3)- p-glucanase, (1- 
5 3)(1-4)~ p-glucanase, esterase, acid phosphatase, pentosamine, endoxylanase, 

xylopyranosidase, arabinofuranosidase, p-giucosidase, (1-6)- (3-glucanase, peroxidase, 
and lysophospholipase. 

Since many protein storage proteins are under the control of a maturation- 
specific promoter, and this promoter is operably linked to a leader sequence for targeting 

10 to a protein body, the promoter and leader sequence can be isolated from a single 
protein-storage gene, then operably linked to a heterologous polypeptide in a chimeric 
gene construct. One exemplary promoter-leader sequence is from the rice Gt1 gene, 
having an exemplary sequence identified by SEQ ID NO: 15. Alternatively, the promoter 
and leader sequence may be derived from different genes. Another exemplary 

15 promoter/leader sequence combination is the rice Gib promoter linked to the rice Gt1 
leader sequence, as exemplified by SEQ ID NO: 16. 
C. Codon Optimization 

It has been shown that production of recombinant protein in transgenic barley 
grain was enhanced by codon optimization of the gene (Horvath et ai, 2000; Jensen ef 

20 a/. f 1996). The intent of codon optimization was to change an A or T at the third position 
of the codons of G or C. This arrangement conforms more closely with codon usage in 
typical rice genes (Huang et a/. t 1990a). 

To obtain a high expression level for heterologous polypeptides in rice cells in the 
examples herein, the polypeptide coding sequences used were codon optimized. The G 

25 + C content was thus increased from 46% to 68%. The codon-optimized lysozyme 

coding sequence for use in practicing the invention, for example, is presented as SEQ ID 
NO:l 

Similarly, in another example, for high level expression of human lactoferrin (hLF) 
in rice cells, the native hLF coding sequence was codon optimized. Out of 693 codons 
30 used in the lactoferrin gene, 413 codons were changed by one or two nucleotides. The 
amino acid sequence of LF was unchanged. The codon optimized lactoferrin coding 
sequence for use in practicing the invention is presented as SEQ ID NO:3. 

Codon optimized sequences for heterologous polypeptides exemplified herein 
are given as follows: for lactoferricin, SEQ ID NO: 7; for EG F, SEQ ID NO: 8; for IGF-1, 
35 SEQ ID NO: 9; for lactohedrin, SEQ ID NO: 10; for kappa-casein, SEQ ID NO: 11; for 
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haptocorrin, SEQ ID NO: 12; for lactoperoxidase, SEQ ID NO: 13; and for alpha-1- 
antitrypsin, SEQ ID NO: 14. 

D. Transcription Factors 
5 In one embodiment of the invention, the transgenic plant herein is also 

transformed with the coding sequence of one or more transcription factors capable of 
enhancing the expression of a maturation-specific promoter. For example, the 
embodiment involves the use of the maize Opaque 2 (02) or prolamin box binding factor 
(PBF), separately or together, or the use of rice endosperm bZip (Reb) protein as 

10 transcriptional activators herein. Exemplary sequence for these three transcription 
factors are given identified below as SEQ ID NOS: 31-33. Transcription factor 
sequences and constructs applicable to the present invention are detailed in co-owned 
PCT application No. PCT/US01/14234, International Publication number WO 01/83792 
A1, published November 8, 2001, which is incorporated herein by reference in its 

15 entireties, including the references cited therein. Reference is also made to Examples 
12-15 herein. 

Transcription factors are capable of sequence-specific interaction with a gene 
sequence or gene regulatory sequence. The interaction may be direct sequence- 
specific binding in that the transcription factor directly contacts the gene or gene 

20 regulatory sequence or indirect sequence-specific binding mediated by interaction of the 
transcription factor with other proteins. In some cases, the binding and/or effect of a 
transcription factor is influenced (in an additive, synergistic or inhibitory manner) by 
another transcription factor. The gene or gene regulatory region and transcription factor 
may be derived from the same type (e.g., species or genus) of plant or a different type of 

25 plant. The binding of a transcription factor to a gene sequence or gene regulatory 

sequence may be evaluated by a number of assays routinely employed by those of skill 
in the art, for example, sequence-specific binding may be evaluated directly using a label 
or through gel shift analysis. 

The ability of a given transcription factor to enhance the expression of a 

30 heterologous polypeptide under a given promoter may vary. The present invention 
includes use of the following exemplary combinations of transcription factors and 
promoters: Reb-GIb promoter; 02-Glb promoter; PBF-GIb; 02-PBF-Glb; 02-RP6; PBF- 
RP6; 02-PBF-RP6; 02- PG5a; PBF-PG5a; 02-PBF-PG5a; 02- Bx7; PBF-Bx7; 02- 
PBF-Bx7; 02-Gt1; PBF-Gt1 ; 02-PBF-Gt1 ; all with or without signal peptide sequences 

35 from protein body genes. 
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As detailed in the cited PCT application, the transcription factor gene is 
introduced into the plant in a chimeric gene containing asuitable promoter, preferably a 
maturation-specific seed promoter operably linked to the transcription factor gene. 
Plants may be stably transformed with a chimeric gene containing the transcription factor 
5 by methods similar to those described with respect to the milk-protein gene(s), 
exemplified herein. Plants stably transformed with both exogenous transcription 
factor(s) and heterologous polypeptide genes may be prepared by co-transforming plant 
cells or tissue with both gene constructs, selecting plant cells or tissue that have been 
co-transformed, and regenerating the transformed cells or tissue into plants. 
10 Alternatively, different plants may be separately transformed with exogenous 

transcription factor genes and milk-protein genes, then crossed to produce plant hybrids 
containing by added genes. 

E Additional Expression Vector Components 

15 Expression vectors or heterologous nucleic acid constructs designed for 

operation in plants, comprise companion sequences upstream and downstream to the 
expression cassette. The companion sequences are of plasmid or viral origin and 
provide necessary characteristics to the vector to permit the vector to move DNA from 
one host to another such as from bacteria to the plant host including, for example, 

20 sequences containing an origin of replication and a selectable marker. Typical 

secondary hosts for production of plasmids for transformation into plants include bacteria 
and yeast. 

In one embodiment, the secondary host is £. co//, the origin of replication is a 
colEi-type, and the selectable marker is a gene encoding ampicillin resistance. Such 
25 sequences are well known in the art and are commercially available as welt (e.g., 
Clontech, Palo Alto, Calif.; Stratagene, La Jolla, CA). 

The transcription termination region may be taken from a gene where it is normally 
associated with the transcriptional initiation region or may be taken from a different gene. 
Exemplary transcriptional termination regions include the NOS terminator from 
30 Agrobactehum Ti plasmid and the rice a-amylase terminator. 

Polyadenylation tails (Alber et a/., 1982) may also be added to the expression 
cassette to optimize high levels of transcription and proper transcription termination, 
respectively. Polyadenylation sequences include, but are not limited to, the Agrobactehum 
octopine synthetase signal, Gielen, et a/., 1984 or the nopaline synthase of the same 
35 species, Depicker, et al., 1 982. 
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Suitable selectable markers for selection in plant cells include, but are not limited 
to, antibiotic resistance genes, such as, kanamycin (npfll), G418, bleomycin, 
hygromycin, chloramphenicol, ampicillin, tetracycline, and the like. Additional selectable 
markers include a bar gene which codes for bialaphos resistance; a mutant EPSP 
5 synthase gene which encodes glyphosate resistance; a nitrilase gene which confers 
resistance to bromoxynil; a mutant acetolactate synthase gene (ALS) which confers 
imidazolinone or sulphonylurea resistance; and a methotrexate resistant DHFR gene. 

The particular marker gene employed is one which allows for selection of 
transformed cells as compared to cells lacking the DNA which has been introduced. 
10 Preferably, the selectable marker gene is one which facilitates selection at the tissue 
culture stage, e.g., a kanamyacin, hygromycin or ampicillin resistance gene. 

The vectors of the present invention may also be modified to include intermediate 
plant transformation plasmids that contain a region of homology to an Agrobacterium 
tumefaciens vector, a T-DNA border region from Agrobacterium tumefaciens, and 
1 5 chimeric genes or expression cassettes (described above). Further, the vectors of the 
invention may comprise a disarmed plant tumor inducing plasmid of Agrobacterium 
tumefaciens. 

In general, a selected nucleic acid sequence is inserted into an appropriate 
restriction endonuclease site or sites in the vector. Standard methods for cutting, 

20 ligating and E. coli transformation, known to those of skill in the art, are used in 

constructing vectors for use in the present invention. (See generally, Maniatis, et a/., 
Molecular Cloning: A Laboratory Manual , 2d Edition (1989); Ausubel, et a/., (c) 
1987, 1988, 1989, 1990, 1993, current protocols IN molecular BiOLQGY . John Wiley 
& Sons, New York, N.Y; and Gelvin, S.B., et a/., eds. plant molecular biology 

25 manual , (1990), all three of which are expressly incorporated by reference, herein. 

IV, Generation of Transgenic Plants 

Plant cells or tissues are transformed with expression constructs (heterologous nucleic 
acid constructs), for example, plasmid DNA, into which the gene of interest has been inserted) 
30 using a variety of standard techniques. It is preferred that the vector sequences be stably 
integrated into the host genome. 

The method used for transformation of host plant cells is not critical to the 
present invention. For commercialization of recombinant heterologous polypeptides 
expressed in accordance with the present invention, the transformation of the plant is 
35 preferably permanent, Le. by integration of the introduced expression constructs into the 



39 



WO 02/064750 



PCT/US02/04909 



host plant genome, so that the introduced constructs are passed onto successive plant 
generations. The skilled artisan will recognize that a wide variety of transformation 
techniques exist in the art, and new techniques are continually becoming available. 

Any technique that is suitable for the target host plant may be employed within 
5 the scope of the present invention. For example, the constructs can be introduced in a 
variety of forms including, but not limited to, as a strand of DNA, in a plasmid, or in an 
artificial chromosome. The introduction of the constructs into the target plant cells can 
be accomplished by a variety of techniques, including, but not limited to calcium- 
phosphate-DNA co-precipitation, electroporation, microinjection, Agrobacterium- 

10 mediated transformation, liposome-mediated transformation, protoplast fusion or 

microprojectile bombardment. The skilled artisan can refer to the literature for details 
and select suitable techniques for use in the methods of the present invention. 
Exemplary methods for plant transformation are given in Example 2. 

When Agrobacterium is used for plant cell transformation, a vector is introduced 

1 5 into the Agrobacterium host for homologous recombination with T-DNA or the Ti- or Ri- 
plasmid present in the Agrobacterium host. The Ti- or Ri-plasmid containing the T-DNA 
for recombination may be armed (capable of causing gall formation) or disarmed 
(incapable of causing gall formation), the latter being permissible, so long as the vir 
genes are present in the transformed Agrobacterium host. The armed plasmid can give 

20 a mixture of normal plant cells and gall. 

In some instances where Agrobacterium is used as the vehicle for transforming 
host plant cells, the expression or transcription construct bordered by the T-DNA border 
region(s) is inserted into a broad host range vector capable of replication in £. co// and 
Agrobacterium, examples of which are described in the literature, for example pRK2 or 

25 derivatives thereof. See, for example, Ditta et a/., 1980 and EPA 0 120 515, expressly 
incorporated by reference herein. Alternatively, one may insert the sequences to be 
expressed in plant cells into a vector containing separate replication sequences, one of 
which stabilizes the vector in E. coli t and the other in Agrobacterium See, for example, 
McBride et aL, 1990, wherein the pRiHRI (Jouanin, et a/„ 1985, origin of replication is 

30 utilized and provides for added stability of the plant expression vectors in host 
Agrobacterium cells. 

Included with the expression construct and the T-DNA is one or more selectable 
marker coding sequences which allow for selection of transformed Agrobacterium and 
transformed plant cells. A number of markers have been developed for use with plant 

35 cells, such as resistance to chloramphenicol, kanamycin, the aminoglycoside G41 8, 
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hygromycin, or the like. The particular marker employed is not essential to this 
invention, with a particular marker preferred depending on the particular host and the 
manner of construction. 

For >4gro6acter/um-mediated transformation of plant cells, explants are incubated 
5 with Agrobacterium for a time sufficient to result in infection , the bacteria killed, and the 
plant cells cultured in an appropriate selection medium. Once callus forms, shoot 
formation can be encouraged by employing the appropriate plant hormones in 
accordance with known methods and the shoots transferred to rooting medium for 
regeneration of plants. The plants may then be grown to seed and the seed used to 
10 establish repetitive generations and for isolation of the recombinant protein produced by 
the plants. 

There are a number of possible ways to obtain plant cells containing more than 
one expression construct. In one approach, plant cells are co-transformed with a first 
and second construct by inclusion of both expression constructs in a single 

1 5 transformation vector or by using separate vectors, one of which expresses desired 
genes. The second construct can be introduced into a plant that has already been 
transformed with the first expression construct, or alternatively, transformed plants, one 
having the first construct and one having the second construct, can be crossed to bring 
the constructs together in the same plant. 

20 A. Plants 

Host cells of the present invention include plant cells, both monocotyledenous 
and dicotyledenous. In one embodiment, the plants used in the methods of the present 
invention are derived from monocots, particularly the members of the taxonomic family 
known as the Gramineae. This includes all members of the grass family of which the 

25 edible varieties are known as cereals. The cereals include a wide variety of species 
such as wheat (Triticum sps.), rice (Oryza sps.) barley (Hordeum sps.) oats, (Avena 
sps.) rye (Secale sps.), corn (maize) (Zea sps.) and millet (Pennisettum sps,). In 
practicing the present invention, preferred grains are rice, wheat, maize, barley, rye, 
triticale. Also preferred are dicots exemplified by soybean (Glycine spp.). However, the 

30 heterologous expression constructs herein can be used in any monocots or dicots, 
whether edible or not, for production of any heterologous polypeptides whether for 
human or animal consumption or use, whether for health benefits or for industrial 
purposes. 

Plant cells or tissues are transformed with an expression vector comprising the 
35 coding sequence for a desired heterologous polypeptide. Transgenic plant cells obtained 
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as a result of such transformation express the coding sequence for the heterologous 
polypeptide, exemplified herein as lysozyrne, lactoferrin or ^-antitrypsin. The transgenic 
plant cells are cultured in medium containing the appropriate selection agent to identify 
and select for plant cells which express the heterologous nucleic acid sequence. After 
5 plant cells that express the heterologous nucleic acid sequence are selected, whole 
plants are regenerated from the selected transgenic plant cells. Techniques for 
regenerating whole plants from transformed plant cells are generally known in the art. 
Transgenic plant lines, e.g., rice, wheat, corn or barely, can be developed and genetic 
crosses carried out using conventional plant breeding techniques. 

10 Production of recombinant proteins in seeds, e.g., rice (Oryza sativa L) seeds, 

has the advantages that (a) high level expression makes it an economically practical 
strategy, and (b) rice is a normal part of the diet of infants and children, has good 
nutritional value and low allergenicity. Thus, the use of rice as the basis for a food 
supplement containing the heterologous polypeptide is unlikely to introduce any risk 

1 5 associated with animal pathogens and thereby eliminates the need for a high degree of 
purification, for example, when included in infant formula. 

In addition, rice is the staple food crop of more than half the world's population. 
Recent reports on the production of provitamin A (beta-Carotene) in rice seeds 
exemplifies the unmet need for value-added food crops, especially in the developing 

20 world (Ye et ai, 2000) where rice is used as major food crop. 



VI. Detecting Expression of Recombinant Heterologous Proteins 

Transformed plant cells are screened for the ability to be cultured in selective 
media having a threshold concentration of a selective agent Plant cells that grow on or 

25 in the selective media are typically transferred to a fresh supply of the same media and 
cultured again. The explants are then cultured under regeneration conditions to produce 
regenerated plant shoots. After shoots form, the shoots are transferred to a selective 
rooting medium to provide a complete plantlet. The plantlet may then be grown to 
provide seed, cuttings, or the like for propagating the transformed plants. The method 

30 provides for efficient transformation of plant cells with expression of a gene of 
autologous or heterologous origin and regeneration of transgenic plants, which can 
produce a recombinant human milk protein. 

The expression of the recombinant heterologous polypeptide may be confirmed 
using standard analytical techniques such as Western blot, ELISA, PGR, HPLC, NMR, 
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or mass spectroscopy, together with assays for a biological activity specific to the 
particular protein being expressed. 

Example 3 describes the characterization of human lysozyme produced in the 
seeds of transgenic rice plants. Analyses used to confirm that recombinant lysozyme 
5 produced in transgenic rice is essentially the same as the native form of the protein both 
in physical characteristics and biological activity included, SDS-PAGE, reverse IEF gel 
electrophoresis, Western blot analysis, enzyme linked immunosorbant assay (ELISA), 
enzymatic activity assay and bactericidal activity assay using indicator strains, 
Micrococcus luteus and E. coli strain JM1 09. 

10 Example 4 describes the characterization of human lactoferrin produced in the 

seeds of transgenic rice plants. Analyses used to confirm that recombinant lactoferrin 
produced in transgenic rice is essentially the same as the native form of the protein both 
in physical characteristics and biological activity included, Southern blot, Western blot, 
ELISA, N-Terminal Amino Acid Sequencing, analysis of glycosylation and determination 

15 of sugar content, a determination of the isoelectric point, pH dependent iron release of 
rLF, bacteriostatic activity assay of rLF using enteropathogenie E. coli as the indicator 
strain. 

Example 5 details the characterization of a t -antitrypsin produced by transgenic 
monocot plant cells, and Example 6 shows other milk proteins also produced by 
20 monocot plant transformed with the chimeric genes of the invention. 

VI I. Preparation of Seed Compositions, and Methods of their Use 

The invention provides, in one aspect, a protein-containing product composed of 
a monocot seed fluor, extract, or malt composition containing monocot seed or rnalted- 

25 seed components, one or more seed-produced, non-seed proteins in substantially 
unpurified form, and a vehicle containing the extract in a form suitable for human or 
animal use. For use in a food or feed product, the vehicle may be a capsule, binder 
components effective to tabietize the composition, a consumable liquid, or a consumable 
suspension. The vehicle may be a processed food in which the extract is mixed. Below 

30 are described methods for preparing the flour, extract, or malt compositions. 

A. Flour composition 

The flour composition is prepared by milling mature monocot plant seeds, using 
standard milling and, optionally, flour purification methods, e.g., in preparing refined 
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flour. Briefly, mature seeds are dehusked, and the dehusked seeds then ground into a 

fine flour by conventional milling equipment. 

The flour may be added to foods during food processing according to standard 

food processing methods. Preferably, the processing temperature does not lead to 
5 denaturation of the milk proteins, e.g.; above 60°-70°C. The flour may also be used 

directly, either in capsule, tabletized, or powder form, as a neutriceutical composition. 

For producing cosmetic or care products, such as toothpaste or topical creams, the flour 

may be blended with other components of the product (the vehicle). For preparing a 

surgical dressing or surgical powder, e.g., for blot clotting, where the heterologous 
10 protein is fibrinogen or other clotting factor(s), the vehicle is a surgical dressing or 

container for delivering the powder. 

One preferred flour composition contains lactoferrin and/or lysozyme. The flour 

may alternatively, or in addition, include one or more of the other human milk proteins 

such as epidermal growth factor, insulin-like growth factor-1, lactohedrin, kappa-casein, 
15 haptocorrin, lactoperoxidase, and a n -antitrypsin. Alternatively, the flour may contain 

another type of therapeutic protein, as given above, a clotting factor, an industrial 

enzyme, or other protein useful in a substantially unpurified state in a monocot-seed 

matrix. 

Flour containing two or more heterologous polypeptides may be prepared by 
20 combining flour from seeds that separately produce the different proteins, for example, 
equal amounts of a flour containing lysozyme and a flour containing lactoferrin. 
Alternatively, a multi-protein composition can be prepared as seed flour from plants, 
such as monocot plants co-transformed with chimeric genes expressing different milk 
proteins, e.g., lactoferrin and lysozyme. 

25 

B. An extract 

An extract composition is prepared by milling seeds to form a flour, extracting the 
flour with an aqueous buffered solution, and optionally, further treating the extract to 
partially concentrate the extract and/or remove unwanted components. Details of 

30 exemplary methods for producing the extract composition are given in Example 1 1 . 
Briefly, mature monocot seeds, such as rice seeds, are milled to a flour, and the flour 
then suspended in saline or in a buffer, such as Phosphate Buffered Saline ("PBS"), 
ammonium bicarbonate buffer, ammonium acetate buffer, Tris buffer or a volatile buffer 
that would evaporate upon drying. The flour suspension is incubated with shaking for a 

35 period typically between 30 minutes and 4 hours, at a temperature between 20-55°C. 
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The resulting hornogenate is clarified either by filtration or centrifugation. The clarified 
filtrate or supernatant may be further processed, for example by ultrafiltration or dialysis 
or both to remove contaminants such as lipids, sugars and salt. Finally, the material 
may dried, e.g., by lyophilization, to form a dry cake or powder. The extract has the 
5 advantage of high recombinant polypeptide yields, limiting losses associated with protein 
purification. At the same time, the recombinant heterologous polypeptides are in a form 
readily usable and available upon ingestion of the extract or food containing the extract. 

One particular advantage of the extract is the low amount of seed starch present 
in the extract. In particular, the extract may increase the concentration of recombinant 

10 protein, from about 0.5% of total soluble protein ("TSP") in conventional approaches to 
over about 25% of TSP in the extract approach. Some of the present extract approach 
even reached 40% of TSP depending on the expression level of the recombinant protein 
in the seeds. In addition, the extract approach removes starch granules, which require 
high gelling temperature, for example above about 75°C. Consequently, the extract 

15 approach provides more flexibility in processing the transgenic rice or other transgenic 
grain containing the recombinant polypeptides into food and nutritional drinks. 

The extract can be used as a nutraceutical for direct use, e.g., in capsule, 
tabletized or powder form, or as a food additive in food processing. In one embodiment, 
the extract is added to an infant milk formula, in an amount typically between 0,1 to 10 

20 percent by dry weight, preferably 1-5% by dry weight of the total formula weight. One 
preferred infant formula contains both lactoferrin and lysozyme, preferably in an amount 
between 50-200% of the amount of human lactoferrin or lysozyme, respectively, of that 
found in normal human milk. As noted above, lactoferrin is present in a concentration of 
about 1 gram /liter human milk, and lysozyme, about 0.1 /liter human milk. 

25 The extract may also be used directly, either in capsule, tabletized, or powder 

form, as a neutriceutical composition/where the product vehicle is the capsule, 
container, or binder for the composition. For producing cosmetic or care products, such 
as toothpaste or topical creams, the extract may be blended with other compoents of the 
product (the vehicel). For preparing a surgical dressing or surgical powder, e.g., for blot 

30 clotting, where the heterologous protein is fibrinogen or other clotting factor(s), the 
vehicle is a surgical dressing or container for delivering the powder. 

One preferred flour composition contains lactoferrin and/or lysozyme. The flour 
may alternatively, or in addition, include one or more of the other human milk proteins 
such as epidermal growth factor, insulin-like growth factor-1 , lactohedrin, kappa-casein, 

35 haptocorrin, lactoperoxidase, and -antitrypsin. Alternatively, the flour may contain 
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another type of therapeutic protein, as given above, a clotting factor, an industrial 
enzyme, or other protein useful in a substantially unpurified state in a monocot-seed 
matrix. 

As above, extract containing two or more heterologous polypeptides may be 
5 prepared by combining extracts from seeds that separately produce the different 
proteins, or by processing seeds from plants co-transformed with chimeric genes 
expressing different milk proteins, e.g., lactoferrin and lysozyme. 

C. Malt composition 

10 In accordance with another embodiment, the invention provides a malt extract or 

malt syrup ("malt") composition in which seed starches have been largely reduced to 
malt sugars, and the milk protein(s) are in an active, bioavailable form. A wide range of 
food and feed products, including processed foods, food supplement, sweetener, and 
feed additive may be produced by varying the types of malt used, the mashing program 

15 and the ways in which the wort is subsequently handled. If materials other than barley 
malt are used in the mash (such as starch from other grains). The resulting product is 
classified as a malt syrup. 

Malt extracts, which may have a syrupy consistency or may be powders, are made 
by mashing ground malt, usually barley malt, in conventional brewery equipment, 

20 collecting the wort and concentrating it or drying it. Modem production of food malt 
extracts and malt syrups has evolved into three basic grain stages: steeping, 
germination, and drying of the germinated seed, followed by three more steps involving 
liquefaction of the germinated grain, mashing of the germinated grain, lautering 
(filtering), and evaporation. Many variations of malt extracts or syrups are possible. 

25 Flavor, color, solids, enzymatic activity, and protein are the basic characteristics that can 
be adjusted during production to provide malts specific for given food applications. (See, 
generallypEley; Hickenbottom, 1996, 1997a, 1997b, 1983; Lake; Moore;Moe; Sfat; 
Doncheck; Briggs, 1981, 1998; and Hough), 
(i) Steeping 

30 After the barley of choice has been cleaned of foreign material, it is graded to size 

and transferred to steep tanks equipped with water inlet and outlet pipes. Compressed 
air is fed from the tank bottom for vigorous aeration and mixing for the barley/water 
mixture. When the barley has reached a water content of 43-45%, steeping is stopped, 
(ii) Germination 
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The steeped barley is moved to germination floors or rooms depending on the 
particular malt house's capabilities and allowed to germinate under controlled 
temperature, air, and moisture conditions. Total germination varies from four to seven 
days, depending on the barley type, density end use of the malt, and the controls or 
5 germination method used. All aspects of germination must be kept in constant balance 
to ensure proper kernel modification and yield. 

Many enzymatic systems are activated during germination. Two of the systems are 
the oxidative and reductive systems involved with the respiration phase. Other enzymes 
break down the endosperm cell structure, which in itself if a measure of germination rate 
10 when the pentose production is evaluated. The proteolytic enzymes release or active 
beta-amylase and also work on the proteins present to render them soluble. In fact, 
about 40% of the total protein is made soluble in water. Optimum germination activates 
a balanced enzyme system, which hydrolyzes the starch present. 

(iii) Kilning 

15 Drying or kilning, when done at the proper time and optimum degree of starch 

modification, stops the germination. The heat also catalyzes additional reactions, 
notably flavor and color development. When drying is complete, the sprouts and other 
extraneous materials are removed, and the kernels are then ready for further 
processing. 

20 

(iv) Malt Extracts And Syrups 

The malted barley (kernel) is coarsely ground in crushers and fed into mash tuns 
where it is mixed with water. During a series of time and temperature changes, some of 
the starch is converted into fermentable sugars by action of the natural alpha- and beta- 

25 amylases, better known as the diastatic system. If cereal adjuncts are to be added, 

which result in malt syrups with mellower and sweeter flavors than the extracts, they are 
added at this stage usually derived from the cereal grains, corn and rice, although 
barley, wheat, rye, millet and sorghum are sometimes used, derived from mature seeds 
that produce the desired recombinant milk proteins. 

30 Once the mash batch has achieved the correct degree of hydrolysis, it is transferred 

to lauter tuns. The lauter tun has a slotted or false bottom a few inches above the real 
bottom to allow for filtration and is also equipped with some means of agitation. During 
this extraction stage, the amyolytic enzymes liquefy additional insoluble starches, 
converting them to maltose and dextrins. At the same time, the proteolytic enzymes 

35 attach certain proteins converting them into simpler, soluble forms. After the appropriate 
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conditions have been met, the liquid phase, or wort, is drawn from the lauter tuns into 
evaporators. 

Evaporation of the wort is conducted under vacuum where it is converted into a 
syrup of about 80% solids. Depending on the temperatures used, malt extracts or 
5 syrups of high, medium, or zero enzymatic activity can be produced. Color and flavor 
also can be controlled during this stage. The finishing steps of filtering, cooling, and 
packaging complete the malt extract/syrup process. 

(v) Transgenic Malt Extract 

For a transgenic malt extract, the starting barley is a transgenic barley 
10 engineered to produce on or more human milk proteins in the endosperm either in grain 
maturation or in the malting process, or at both times. Malting and processing times and 
conditions are adjusted so that the bioactivity of the target recombinant molecules is 
preserved and the bioavailability of the target recombinant molecule is maximized. The 
resulting malt extract is either consumed directly as a concentrate that is either 
15 consumed directly as a food, or is incorporated as an ingredient in a food mixture. 
Studies conducted in support of the present invention demonstrate that recombinant 
proteins retain activity after malting for up to at least 288 hrs. 

(vi) Transgenic Malt Syrup 

For a transgenic malt syrup, the starting barley can be a non-transgenic barley, or a 

20 transgenic barley, or a mixture of both. The barley is processed as described, except 
that during the mashing process, a cereal adjunct is added in a form that it is converted 
during the mashing process with the concurrent retention and generation of 
bioavailability and bioactivity of the target recombinant molecule fond within the 
transgenic cereal adjunct. The use of a transgenic cereal adjunct enables the 

25 production in the malt syrup of the target recombinant molecule expressed in the 
transgenic grain endosperm. 

The malt extract or syrup may be used directly as a syrup, or added to processed 
foods or drinks, according to standard food processing procedures that employ grain 
extracts or syrups, e.g., for sweetening. One food is an infant formula containing 

30 between 0.1 to 10% malt (extract or syrup). The malt is also useful as a 

sweetener/nutritional additive in baby and adult foods, and nutritional drinks. 

Preferred malt extracts or syrups contain iactoferrin and/or lysozyme. The malt may 
alternatively, or in addition, include one or more of the human milk proteins such as 
epidermal growth factor, insuiin-like growth factor- 1, lactohedrin, kappa-casein, 

35 haptocorrin, lactoperoxidase, alpha- 1 -antitrypsin and immunoglobulins. Malts containing 
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other proteins, e.g., therapeutic proteins, which may be delivered orally are also 
contemplated. As above, malt containing two or more milk proteins may be prepared by 
combining or preparing malts from seeds that separately produce the different proteins, 
or by preparing a malt from the seeds of plants co-transformed with chimeric genes 
5 expressing different milk proteins, e.g., lactoferrin and lysozyme. 

D. Seed-Composition Products and Methods 

A variety of products containing a flour, extract, or malt seed composition in a 
suitable vehicle are contemplated, as given below: 

10 (i) Nutritional supplement, sweetener, food, or feed. In this general 

embodiment, the seed composition contains an orally active nutritional peptide, such as 
a milk protein, e.g., lysozyme and/or lactoferrin, in a suitable fluor, extract, or malt 
composition, or an anti-microbial protein. The composition is carried in a suitable 
vehicle which may be a capsule or binder or liquid suspension, for direct ingestion, a 

1 5 container where the composition is used as a sweetener or other additive, or a food or 
feed, where the composition is used to produce a nutritionally enhanced food or feed. 

(ii) Surgical dressing, bandages, and clotting compositions. In this embodiment, 
the seed composition contains a blood-clotting factor, such as fibrinogen in a suitable 
fluor, extract, or dried malt composition. For a bandage product, the vehicle is the 

20 bandage itself, carrying the composition in a dried form, e.g., in a bandage matrix. 
Where the composition is used directly as a clotting agent for surgical clotting, the 
vehicle is a suitable dispenser for delivering the composition in dried or liquid form to a 
surgical wound, 

(iii) Industrial enzyme or detergent product. The composition may contain an 
25 industrial enzyme, such as a protease, useful in detergent or other cleaning or 

processing products. Here the composition is preferably an extract or fluor added 
directly to the product of interest. 

(iv) Anti-infective agent. The composition may contain a seed-produced 
antimicrobial agent, such as the anti-microbial milk proteins as lactoferrin, lysozyme, 

30 lactoferricin, lactohedrin, kappa-casein, haptocorrin, or lactoperoxidase, a milk protein 
like alpha-1 -antitrypsin that may function as an antimicrobial by inhibiting proteolysis of 
other anti-microbial proteins, non-milk anti-microbial proteins and acute-phase proteins, 
e.g., proteins that are produced normally in production animals in response to infection, 
such as C-reactive protein, serum amyloid A; ferritin, haptoglobin, seromucoids, 

35 ceruloplasmin, 15-keto-13,14-dihydro-prostaglandin F2 alpha, fibrinogen, alpha-1-acid 
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glycoprotein, mannose binding protein, lipopolysaccharide binding protein, alpha-2 
macroglobulin and defensins, anti-microbial protein(s), such as cecropin, magainin, 
defensins, tachyplesin, parasin I, buforin I, PMAP-23, moronecidin, anoplin, gambicin, 
and SAMP-29, or other anti-microbial proteins, such as .CAP37, granuiysin, secretory 
5 leukocyte protease inhibitor, CAP18, ubiquicidin, bovine antimicrobial protein- 1, Ace- 
AMP1, tachyplesin, big defensin, Ac-AMP2, Ah-AMP1, and CAP18. 

The compdsition may be added to food or feed, to confer enhanced resistance to 
microorganisms, or may be used directly as a nutraceutical product, e.g., in capsule 
form, for oral administration. The composition containing the anti-microbial protein may 

10 be incorporated, typically in fluor or extract form, into cosmetics, bandages, or other 
health care products to confer anti-microbial properties to the product. 

From the foregoing, it can be appreciated how various objects and features of the 
invention are met. The production of high levels of heterologous polypeptide in grains, 
exemplified herein by rice, provides the distinct advantage that protein-containing 

15 compositions useful in a variety of products may be prepared with little or no purification. 

Transgenic seeds are ideal bioreactors, combining low production costs and low 
or minimal downstream processing costs prior to use. Seed grain proteins can 
accumulate to 9-19% of grain weight (Lasztitym 1996); the endosperm proteins are 
synthesized during grain maturation and stored in protein bodies for use in the 

20 germination and seedling growth of the next plant generation; grains can be stored for 
years without loss of functionality, and therefore the downstream processing can be 
conducted independently of growing seasons. 

The results presented herein demonstrate that human milk proteins may be 
expressed at high levels in the seeds of transgenic plants, e.g., up to 0.25 to 1% of total 

25 seed dry weight. The production of high levels of human milk proteins in grains, 

exemplified herein by rice, provides the distinct advantage that food supplements may 
be prepared with little or no purification. Since the recombinant grain finds utility as a 
food or food supplement, as a flour, extract or malt, the regulatory requirements for 
purity are not stringent. 

30 

All publications, patents and patent applications are herein expressly 
incorporated by reference in their entirety to the same extent as if each individual 
publication, patent or patent application was specifically and individually indicated to be 
incorporated by reference in its entirety. 
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The following examples illustrate but are not intended in any way to limit the 
invention. 

EXAMPLE 1 

5 Expression Vectors for Generation of Transgenic Plants 

In general, expression vectors were constructed using standard molecular 
biological techniques as described in Ausubel et a/ M 1 987. The vectors contain a 
heterologous protein coding sequence for lactoferrin or lysozyme under the control of a 
rice tissue-specific promoter, as further described below. 

10 

A. An Expression Vector For Human Lysozyme Expression In Transgenic Rice 

Cells 

The synthesized lysozyme gene was cloned into an API base vector pAPI137 by 
conventional molecular cloning techniques (Sambrook et a/., 1989). Plasmid pAPI137 

1 5 contains the RAmyZD promoter (Huang et a/., 1 993), the codons for the RAmy3D signal 
peptide and the RAmy3D terminator. The RAmy3D promoter, isolated from the rice 
amylase gene family, is activated in rice calli by sugar starvation (Huang et a/., 1993). 
The human lysozyme gene was placed between the sequences of the RAmy3D signal 
peptide and the RAmy3D terminator to give plasmid pAPI156 having a size of 4829 bp. 

20 The promoter of the rice Glutelin 1 gene (Gt-1) and the nucleotide sequence of 

the signal peptide were cloned with two primers based on the published Gt1 gene 
sequence (Okita et at. J Biol Chem 264: 12573-12581, 1989). The forward primer with 
HindlW site was named, MV-GM-F1 ; 5 f - 

ATCGAAGCTTCATGAGTAATGTGTGAGCATTATGGGACCACG-3 , (SEQ ID NO:5). 

25 The reverse primer was named Xba-Gf-f-R1 ; 5- 

CTAGTCTAGACTCGAGCCACGGCCATGGGGCCGGCTAGGGAGCCATCGCACAAGA 
GGAA-3' (SEQ ID NO:6). Genomic DNA was isolated from leaves of rice variety M202 
(Dellaporta et al., 1983). The PGR product amplified from the genomic DNA was cloned 
into pCR 2.1 (Invitrogen, Carlsbad, CA). The resulting plasmid was named pCRGM or 

30 pAPI134. 

To generate a Gt-1 expression plasmid, pAPI134 was digested with HindlW and 
Xbal. The fragment containing the Gt-1 promoter and Gt-1 signal peptide was cloned 
into a pUC19 based plasmid containing the nopaline synthase 3 1 (nos) terminator. The 
resulting plasmid was named pAPI141 and contains the rice Gt-1 promoter, the Gt-1 
35 signal peptide, a multiple cloning site and the nos terminator. 
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The synthesized human lysozyme gene "lys-ger* (by Operon Technologies, Inc., 
Alameda, CA) that was optimized based on the rice gene codon usage was digested 
with Dra\ and Xho\ and cloned into pAFI141 digested with A/ael and Xho\ according to 
standard cloning techniques (Sambrook et aL, 1989). The resulting plasmid was called 
5 pAPI159 (Fig. 1) having a size of 41 31 bp. 

B. An Expression Vector For Human Lactoferrin Expression In Transgenic Rice 
The hLF gene (Rey, MW, 1990) was codon optimized and synthesized by 
Operon Technologies (CA, USA). The plasmid containing the codon-optimized gene 
was called Lac-ger. Lac-ger was digested with Sma\/Xho\ and the fragment containing 
10 the lactoferrin gene was cloned into pAPI141 which was partially digested with Nae\ and 
completely digested with XhoL The resulting plasmid was named pAPl 164. For 
expression of hLF in rice seeds, the codon optimized gene was operably linked to the 
rice endosperm specific giutelin (Gt1) promoter and NOS terminator (Fig. 7). 

15 EXAMPLE 2 

Generation Of Transgenic Plant Cells Expressing Human Milk Proteins 
The procedure of microprojectile-mediated rice transformation (US Patent 
6,284,956) was followed. Calli was raised from TP309 mature rice seeds, with calii two 
to four mm in diameter selected and placed on N6 media supplemented with 0.3 M 

20 mannitol and 0.3 M sorbitol for 20 hours before bombardment. Biolistic bombardment 
was carried out with the biolistic PDC-1000/He system (Bio-Rad, USA). Plasmid 
carrying milk protein genes and pAPI76, a plasmid carrying hygromycin selectable 
marker gene were gold-coated and co-bombarded at a ratio of 6:1 with a helium 
pressure of 1 100psi. Two day old bombarded calli were then transferred to N6 selection 

25 media supplemented with 20 mg/l hygromycin B and allowed to grow in the dark at 26°C 
for 45 days. 

In order to develop transgenic rice plants, the selected calli were transferred to 
pre-regeneration and regeneration media. When regenerated plants became 1-3 cm in 
height, the plantlets were transferred to rooting media which consisted of half 
30 concentration of MS and 0.05 mg/l NAA. After two weeks, plantlets with developed roots 
and shoots were transferred to soil and kept under the cover of plastic container for a 
week. The plants were allowed to grow about 12 cm tall and shifted to the green house 
where they were grown up to maturity. 

35 
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A. A Plasmid for Hyqromvcin Selection 

Plasmid, pAPI76, carrying the bacterial hygromycin B phosphotransferase (hpt) 
gene was made as described in U.S. Patent No. 6,284,956, and was used for co- 
transformation of calli to allow selection of the transformants. Briefly, pAPI176 was 
5 created as follows: A DNA fragment was amplified from a rice a-amylase gene, 
RArnyl A (Huang et a/;, 1990b) and cloned into pBluescript KS+ at the Small EcoRl 
restriction sites and the resulting plasmid was called p1 AT. The PCR amplified 
fragment contained 297 bp of RAmyl A terminator. A SamHI DNA fragment from pGL2 
(Shimamoto et a/., 1989) was cloned into the SamHI site of p1 AT and the resulting 
10 plasmid was called pAPI174. Finally a Sac\/Xba\ fragment amplified from glucanase 
gene, Gns9 (Romero et a/ M 1998) was inserted into pAPI174 using the same restriction 
sites. The PCR generated, Gns9 promoter fragment was confirmed by DNA 
sequencing. The resulting plasmid was named as pAPI176. 

15 B. Generation of Human Lvsozyme Expressing Transgenic Rice Cells and 

Plants 

The synthetic human lysozyme (hLys) gene under the control of the RAmySD 
promoter and terminator in the pAPI156 plasmid (example 1A) was used to generate 
sixty independent transformants by particle bombardment-mediated transformation. 

20 Particle bombardment mediated transformation of rice was carried out as 

described above. Briefly, rice calli derived from TP309 were bombarded with gold 
particles coated with plasmids pAPI156 and pAPI76 in a ratio of 6:1 using the helium 
biolistic particle delivery system, PDS 1000 (Bio-Rad, CA). Transformed calli were 
selected in the presence of hygromycin B (35 mg/L) on N6 (Sigma, MO). 

25 Selected cell lines were maintained in culture media with 3% sucrose (Huang et 

aL, 1993). Lysozyme expression was induced by sugar starvation. Briefly, AA medium 
(containing 3% sucrose) was removed by aspiration, followed by washing the cells three 
times with AA minus sucrose (AA-S). The cells were then incubated with AA-S at 40% 
(v/v) density for three and a half days to obtain the optimal level of lysozyme expression. 

30 Transformants expressing lysozyme were identified by immunoblot analysis, 

turbidimetric rate determination with Micrococcus lysodeikticus or ELISA. Calli were 
ranked according to the expressed lysozyme level. Suspension cell cultures from the 
top lines were established following the procedure described previously (Huang et a/., 
1993). The amount of total protein (Bradford assay) and lysozyme (ELISA) was 

35 evaluated in selected calli (Table 1 ). 
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Table 1 . Expression Level Of Human Milk Lvsozvme In Transformed Calli 



l Cell line 


Calli (g) 


Total protein (pg) 


Lysozyme (pg) 


Lysozyme/protein (%) 


156-1 


0.39 


2626.5 


65.7 


2.5 


156-5 


0.38 


5510 


68.9 


1.25 


156-16 


0.4 


4815 


120.4 


2.5 


156-19 


0.44 


2440 


30.5 


1.25 


156-28 


0.49 


4910 


24.6 


0.5 


156-43 


0.56 


8150 


101.9 


1.25 


156-47 


0.37 


2472 


6.2 


0.25 



The synthetic human lysozyme (hLys) gene under the control of the Gt1 



5 promoter and Nos terminator in the pAPI159 plasmid (Fig. 1) was used to generate 
independent transformants by particle bombardment-mediated transformation. 
Transformed calli were selected as described above, then transferred to pre- 
regeneration and regeneration media. When regenerated plants became 1-3 cm in 
height, the plantlets were transferred to rooting media which consisted of half 

10 concentration of MS and 0.05 mg/l NAA. After two weeks, plantlets with developed roots 
and shoots were transferred to soil and kept under the cover of plastic container for a 
week. The plants were allowed to grow about 12 cm tall and shifted to the green house 
where they were grown up to maturity (R0 plants). 

Screening for R0 plants expressing human lysozyme. Individual rice 

15 endosperms or grains were ground with cold phosphate buffered-saline (PBS) with the 
addition of 0.35 M NaCI. Grinding was conducted with a pre-cooled mortar and pestle at 
1 ml buffer/grain. Clear grain homogenate was obtained by subjecting the resulting 
grain extract to centrifugation at 14,000 rpm for 10 min at 40°C. 

Embryos from individual R t seed (derived from Rq plants) that showed a level of 

20 lysozyme expression that was greater than 10 pg/seed were saved and used to 

generate R1 plants. Briefly, seeds were dissected into embryo and endosperm portions. 
The endosperm was ground and assayed for lysozyme expression (as further described 
below). Embryos were sterilized in 50% commercial bleach for 25 minutes and washed 
with sterile H 2 0 three times for 5 minutes each. Sterilized embryos were placed in a 

25 tissue culture tube that contained MS solid medium. Embryos germinated and plantlets 
having about three inches shoots and healthy root systems were obtained in two weeks. 
The plantlets were then transferred to pots to obtain mature plants (Ri). 
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A total of 197 embryos from 12 selected R0 plants were germinated and 157 R<| 
seedlings planted in the greenhouse for generation of R2 grains. Individual R2 grains 
(n=1502) from 109 R1 fertile plants were screened for lysozyrne expression by iysozyme 
activity assay in order to identify 42 homozygous plants. 
5 Homozygous R1 plants were identified by analyzing positive expressions of 

recombinant human Iysozyme (rHIys) from a minimum of 20 individual R2 grains, 
Homozygous lines derived from these plants were planted in a rice field in California. 
During growth, agronomic characteristics of both transgenic and non-transgenic plants, 
such as plant height, percentage of fertility, number of effective tillers, filled 

10 grains/panicle, non-filled grains/plant, time to maturity and 1000 grain weight were 

determined and compared. Plants with satisfactory agronomic traits were selected and 
rHIys expression levels were determined by Iysozyme activity assay. Plants that met the 
criteria for satisfactory agronomic traits and had more than 35 pg of rHIys/grain were 
advanced to next generations. 

15 SDS-PAGE, electroblotting and Western blot analysis were carried out with 18% 

precast gel (Invitrogen, Carlsbad, CA) as described in Example 3. The primary rabbit 
polyclonal antibody against human Iysozyme was purchased from Dako A/S (Denmark) 
and used at 1:5000. Lysozyrne was quantified by a turbidimetric activity assay with 
Micrococcus luteus (Sigma) on 96-well microtiter plate as described in Example 3. 

20 Briefly, 250 pi of 0.01 5% M. luteus cell suspension was incubated with 1 0 pi of samples 
containing Iysozyme with a concentration less than 2.4 Mg/ml. The reaction was followed 
by the kinetic mode in Microplate Manager (Bio-Rad, CA) for 5 min at 450 nm. The 
concentration of iysozyme was then determined in reference to the standard curve. 

The stable expression level of human Iysozyme (rHIys) reached at least about 

25 0.6% rHIys per brown rice weight amounting to 45% of the total soluble protein extract 
from rice grain. Figure 2 illustrates the seed specific expression of human Iysozyme in 
transgenic plants. rHIys is only found in mature and germinated grain, but not in any 
other tissues tested. Fig. 6 shows the expression level of human Iysozyme in powdered 
R3 seeds taken from transgenic rice plants 

30 

C. Generation of Human Lvsozvme-Expressinq Transgenic Wheat Cells and 

Plants 

Plasmid AP1 159 (Figure 1) and API230 (Figure 35) were used to transform wheat 
cells substantially in the same manner as in transforming rice cells. Eight transgenic 
35 wheat lines were produced with API159, generating an expression level of about 150 to 
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300 /yg of lysozyme per grain. Two transgenic wheat lines were produced with API230, 
yielding an expression level of about 50 to 120 j/g of lysozyme per grain. 

D. Generation of Human Lvsozyme-Expressinq Transgenic Barlev Cells and 

5 Plants 

The plasmid AP1 1 59 was also used to transform barley cells substantially as 
described as transformation of rice cells. Five transgenic barley lines were produced, 
yielding about 3.9 to 12.3 //g of lysozyme per grain. 

10 E. Generation of Human Lactoferrin Expressing Transgenic Rice Cells and 

Plants 

The synthetic human lactoferrin gene under the control of the Gt1 promoter in the 
pAPI164 plasmid was used to generate over 100 independent transformants by particle 
bombardment-mediated transformation. 

15 Particle bombardment mediated transformation of rice was carried out as 

described above. At least 20 R1 grains from each R0 plant were analyzed for rHLF 
expression. Individual R1 grains were cut into halves. The endospermic half was 
subjected to rHLF expression analysis by Western blot or ELISA and the corresponding 
positive embryonic half was germinated to generate R1 seedlings. The seedlings were 

20 transplanted to generate R2 grains. During the screening of R1 grains we observed that 
all the positive grains were opaque-pinkish in color in comparison to negative or control 
grains. The opaque-pinkish color in rice grains was then used to identify homozygous 
lines. A transgenic plant was considered to be homozygous and expressing rHLF if all 
grains from that plant were opaque-pinkish. Homozygous lines were then confirmed by 

25 ELISA analysis. Based on the expression analysis and agronomic characters, selected 
homozygous R2 lines were advanced to R3 and R6 generations. 

EXAMPLE 3 

Characterization of Recombinant Human Lysozyme (rLvs) Produced by Transgenic Rice 
30 Cells and Plants 

A. Southern blot analysis 

About three grams of young leaves were collected and grounded with liquid 
nitrogen into a fine powder. The genomic DNA was isolated according to the procedure 
as described in Dellaporta et a/., 1983, and purified by phenol-chloroform extraction. 
35 Approximately 5 pg of DNA was then with Hind\l\ and EcoRI, separated on a 1 % 
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agarose gel, blotted onto a Hybond + membrane (Amersham Pharmacia Biotech, 
Piscataway, NJ). The blot was probed with gel purified human Hlys gene and developed 
by ECL™ direct nucleic acid labeling and detection system (Amersham Pharmacia). 
By comparing to known amounts of the intact 1470 bp human lysozyme (Hlys) gene, the 
5 intact copy number of the transgenes, including promoter and Hlys gene, was estimated 
to vary from about 1 to about 6. No positive correlation between copy number of the 
rHIys transgene and amount of rHIys synthesized was discernible. 

B. SDS-PAGE and reverse IEF gel electrophoresis 

10 Induced calli or harvested cells from suspension cell cultures were ground with 

cold phosphate buffered-saline (PBS) with a protease inhibitor cocktail (2 pg/ml 
aprotonin, 0.5 pg/ml leupeptin, 1 mM EDTA and 2 mM Pefabloc). The protease inhibitor 
cocktail was excluded from the buffer used subsequently during the purification of the 
enzyme, since the inhibitors did not increase the lysozyme expression yield. Grinding 

15 was conducted with a pre-chilled mortar and pestle at approximately 2 ml buffer/g calli or 
cells. A clear homogenate was obtained by subjecting the resulting extract to 
centrifugation at 16,000 x g for 10 minutes at 4 C. 

SDS-PAGE was carried out using an 18% precast gel (Novex, CA). The resulting 
gel was stained with 0.1% Coomassie brilliant blue R-250 at 45% methanol and 10% 

20 glacial acetic acid for three hours. Gel destaining was conducted with 45% methanol 
and 10% glacial acetic acid until the desired background was reached. 

Reverse IEF gel electrophoresis was parried out using a precast Novex pH 3-10 
IEF gel according to the manufacturer's instructions (Novex, CA). About 30 pg of 
lysozyme was loaded onto the gel and electrophoresed at 100 V for 50 minutes followed 

25 by application of 200 V for 20 minutes. The gel was then fixed in 136 mM sulphosalicylic 
acid and 1 1 .5% TCA for 30 minutes and stained in 0.1% Coomassie brilliant blue R-250, 
40% ethanol, 10% glacial acetic acid for 30 minutes. The destaining solution contained 
25% ethanol and 8% acetic acid. 

30 C. Western blot analysts 

A SDS-PAGE gel was electroblotted to a 0.45 pm nitrocellulose membrane using 
a Mini Trans-Blot Electrophoretic Transfer Cell (Bio-Rad, CA) and subsequently 
subjected to immuno-blotting analysis. The blot was blocked with 5% non-fat dry milk in 
PBS, pH 7.4 for at least two hours followed by three washes with PBS, pH 7.4 for 10 

35 minutes each. The primary rabbit polyclonal antibody against human lysozyme (Dako 
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A/S, Denmark) was diluted at 1:2000 in the blocking buffer and the blot was incubated in 
the solution for at least one hour The blot was then washed with PBS three times for 10 
minutes each. The secondary goat anti-rabbit IgG (H+L)-alkaline phosphatase 
conjugate (Bio-Rad, CA) was diluted in the blocking buffer at 1:4000. The membrane 
5 was then incubated in the secondary antibody solution for one hour and then washed 
three times. Color development was initiated by adding the substrate system BCIP-NBT 
(Sigma) and the process was stopped by rinsing the blot with H2O once the desirable 

intensity of the bands had been achieved. 

10 D. Enzyme Linked Immunosorbant Assay (ELISA) 

An indirect sandwich ELISA was developed to quantify total lysozyme expressed 
in rice calli or cells and used as an alternative assay to determine the lysozyme 
expression yield. A direct sandwich ELISA for lysozyme quantification has been 
previously reported (Lollike et ai, 1995, Taylor, 1992), however an alternate assay was 

15 developed as a key reagent used in the assay is no longer commercially available. 

In carrying out the assay, rabbit anti-human lysozyme antibody (Dako D/K, 
Denmark) was used to coat a 96 well plate at 1:5000 diluted in PBS overnight at room 
temperature. After washing with PBS, the plate was blocked with 5% normal donkey 
serum (Jackson ImmunoResearch Laboratories, PA) in PBS for one hour. The plate 

20 was washed again with PBS. Lysozyme samples were diluted in 0.05% Tween in PBS 
and captured by adding to the plate and incubating for one hour. After washing the plate 
with PBS, sheep anti-human lysozyme at 1:1000 diluted with 0.05% Tween in PBS was 
added and incubated for one hour. The plate was washed again with PBS. Peroxidase- 
conjugated affinipure donkey anti-sheep IgG (H+L) diluted in 0.05% Tween in PBS at 

25 1 :10,000 was added and incubated for one hour. After a final wash of the plate with 
PBS, color was developed by incubating the plate with TMB substrate (Sigma, MO) for 
5-15 minutes and the absorbance read at 655 nm. 

E. Enzymatic Activity Assay For Lysozyme 
30 A reliable and quantitative method was developed to analyze the expression 

level of enzymatically active lysozyme. The turbidimetric assay was developed using a 
96-well microtiter plate format and based on the standard lysozyme assay that is carried 
out spectrophotometrically in cuvettes. A microtiter plate based method previously 
described for the detection of lysozyme release from human neutrophils had a detection 
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range of 1-100 ng/mi (Moreira-Ludewig et a/., 1992). The assay conditions were 
modified to maintain the linearity of detection up to 3.0 yg/ml. 

The enzymatic activity of lysozyme was routinely determined by 
spectrophotometric monitoring of the decrease in turbidity at 450 nm of a suspension of 
5 Micrococcus luteus(M, lysodeikticus) cells (Shugar, 1952). Specifically, 250 pi of a 
0.015% (w/v) Micrococcus leteus cell suspension was prepared in 66 mM potassium 
phosphate, pH 6.24 (buffer A). Cell suspensions were equilibrated at room temperature 
and the reaction was initiated by adding 10 pi samples containing lysozyme with 
concentrations from 0 to 2.4 pg/ml. Lysozyme activity was determined in a kinetic mode 

10 for 5 minutes at 450 nm. The concentration of lysozyme was then calculated by 
reference to the standard curve constructed with human milk-derived lysozyme. 

The enzymatic activity of human milk lysozyme and the rice cell derived 
lysozyme of the invention was compared. As shown in Figure 4, the lysozyme effected 
reduction of the turbidity of Micrococcus leteus cell suspensions at 450 nm was very 

15 similar for lysozyme from the two sources, while buffer alone did not have any effect on 
the reduction of turbidity. 

Three selected suspension cell culture lines were induced to express lysozyme 
and the yield estimated in parallel by ELISA and the enzymatic activity assay described 
above (Table 2). T-test analysis showed that there was no significant difference between 

20 the lysozyme concentration measured by ELISA and enzymatic activity assay (p < 0.05). 
These results demonstrate that active recombinant human milk lysozyme is synthesized 
and maintained in rice callus cells and can be isolated without losing its activity. 

Table 2. Comparison of Lysozyme Yields Estimates by Enzymatic Activity Assay and 
25 ELISA 



Cell line 


Lysozyme yield by enzymatic activity 
assay (lysozy me/total protein yglmg) 


Lysozyme yield by ELISA 
(lysozyme/total protein pg/mg) 


156-5 


25.8 +/- 6.3 


30.3 +/-3.9 


156-16 


32.1 +A5.7 


32.9 +/- 3.2 


156-31 


47.0 +/- 6.2 


42.3 +/- 7.0 



F. Recombinant Human Lvsozvme Has Bactericidal Function 
The sensitive lysis of Micrococcus luteus cells in a turbidimetric assay (Fig. 4) 
30 indicates that recombinant human lysozyme possesses enzymatic activity and functions 
as a bactericide. To confirm this with a gram-negative bacterium, a bactericidal assay 
was carried out using an E.coti strain (JM109) as a test organism (Fig. 3). 
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In carrying out the assay, an aliquot of overnight JM109 culture was grown in LB 
medium until mid log phase. A standard innoculum of mid-log phase JM109 at 2 x 10 5 
CFU (colony forming units )/ml was used in the bactericidal assay. Buffer (20 mM 
Sodium phosphate, pH 7.0, 0.5 mM EDTA) alone, buffer containing human milk 
5 lysozyme or rice seed derived lysozyme at about 30 |jg/ml were sterilized by filtration. 
The mixture of cells and lysozyme solution was then incubated at 37° C for the specified 
length of time. One-fifth of the mixture volume was plated onto the LB agar plates and 
incubated overnight at 37 C in order to determine the number of colony forming units. At 
the concentration of 30 pg/ml, recombinant human lysozyme exhibited a similar 
10 bactericidal effect as lysozyme from human milk. There was no reduction of colony 
forming units using an extract from the non-transgenic control. 

G. Purification of Lysozyme From Rice Calli, Suspension Cultures And 
Transgenic Rice Grains 

1 5 Five rice calli lines expressing high levels of lysozyme were propagated and 

induced by sucrose starvation. The calli or cells were ground by a Tissuemizer in 
extraction buffer (PBS, 0.35 M NaCI) at 2 ml buffer/g of wet calli. The resulting tissue 
homogenate was centrifuged at 25,000 x g for 30 minutes at 4 C. The supernatant was 
removed and subjected to filtration through a pre-filter and then through a 0.45 pm 

20 nitrocellulose filter. 

Approximately 1 liter of filtered supernatant from 500 grams of induced wet calli 
were then dialyzed against 50 mM sodium phosphate, pH 8.5 at 4 C overnight. The 
supernatant was loaded onto a 200 ml SP Sepharose fast flow column (XK26/40, 
Pharmacia) equilibrated with the loading buffer (50 mM sodium phosphate, pH 8.5) at a 

25 flow rate of four ml/min. The column was then washed with the same buffer until a 
baseline of A280 was achieved. Lysozyme was eluted by 0.2 M NaCI in the loading 
buffer and fractions containing lysozyme activity were pooled, concentrated and 
reapplied to a Sephacryl-100 column equilibrated and run with PBS at a flow rate of one 
ml/min. Proteins were eluted and separated by using PBS at a flow rate of one ml/min. 

30 Pure lysozyme fractions were identified by activity assay and total protein assay 
(Bradford) and the purity of lysozyme was confirmed by SDS-PAGE. 

The five lines with the highest lysozyme expression level were selected and 
propagated continuously in petri dishes or shake flasks for lysozyme isolation and 
purification. A crude extract from rice callus contains both recombinant human lysozyme 

35 and large amounts of native rice proteins. Since the calculated pi of lysozyme is 
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approximately 1 1 , a strong cation exchange column, SP-Sepharose fast flow 
(Pharmacia), was chosen as the first column to separate the rice proteins from 
recombinant human lysozyme. Most of the rice proteins did not bind to the column when 
equilibrated with 50 mM sodium phosphate, pH 8.5. The recombinant human lysozyme, 
5 on the other hand, bound to the column and was eluted by 0.2 M NaCI. Rice proteins 
that co-eluted with recombinant human lysozyme, were separated from lysozyme by gel 
filtration through a Sephacryl S-100 column and highly purified recombinant human 
lysozyme was obtained. 

To purify human lysozyme from rice grains, R 2 rice seeds from transgenic plants 

10 were dehusked and milled to flour using conventional methods. Lysozyme was 

extracted by mixing the rice flour with 0.35 N NaCI in PBS at 100 grams/liter at room 
temperature for one hour. The resulting mixture was subjected to filtration through 3 pm 
of a pleated capsule, then through 1 .2 pm of a serum capsule and finally through a 
Suporcap 50 capsule with a 0.8 pm glass filter on top of 0.45 pm filter (Pall, Ml). 

15 The clear rice extract (1 liter) was then dialyzed against 50 mm sodium 

phosphate, pH 8.5 at 4°C overnight and the dialyzed sample was loaded onto a cation 
exchange resin SP-Sepharose (Pharmacia Amersham), which was pre-conditioned with 
50 mm sodium phosphate, pH 8.5 before loading. After loading, the column was 
washed with the same buffer until a base line A280 reading was achieved, then 

20 lysozyme was eluted with 0.2 N NaCI in 50 mm sodium phosphate, pH 8.5. Fractions 
containing lysozyme were pooled and reapplied to a Sephacryl S-100 column (Bio-Rad; 
equilibrated and run with PBS). Pure lysozyme was fractions were identified by 
enzymatic assay and total protein assay (Bradford). Finally the purity of lysozyme was 
confirmed by SDS-PAGE. 

25 

H. Attributes Of Recombinant Human Lysozyme Produced In Rice 
(i). N-Terminal Amino Acid Sequencing 

Recombinant human lysozyme (rLys) isolated from rice cells as described above, 
was separated by 18% SDS-PAGE followed by electroblotting to a PVDF membrane 

30 (Bio-Rad, CA). The lysozyme band was identified by staining the membrane with 0.1% 
Coomassie Brilliant Blue R-250 in 40% methanol and 1% glacial acetic acid for 1 minute. 
The stained PVDF membrane was immediately destained in 50% methanol until the 
band was clearly visible. After the blot was thoroughly washed with H2O and air-dried, it 
was sequenced with a sequencer ABI 477 by Edman degradation chemistry at the 

35 Protein Structure Laboratory of the University of California at Davis. The results showed 



WO 02/064750 PCT/US02/04909 

that the rLys produced in transgenic rice seed had an identical N-terminal sequences to 

the human lysozyme, as follows: 

Recombinant Lys — LysVaLPheGluArg( )GIuLeuAIaArgThr 

Human Lys— — -LysValPheGluArgCysGluLeuAlaArgThr 

5 The blank parenthesis in recombinant lysozyme represents residue Cys which 

cannot be detected by the machine. This cycle was not defined, and could be due to the 

un~modified cysteine residue which cannot form a stable derivative in Edman 

degradation analysis. 

Additionally, a number of structural and functional attributes of human lysozyme 
10 and recombinant lysozyme produced in rice were found to be the same, including 

molecular weight, pi, bactericidal effect with E. coli, thermal and pH stability and specific 

activity. 

(ii). Thermal and pH stability of lysozyme 

For biotechnological applications of the recombinant human lysozyme, its 

15 thermal and pH stability as well as its resistance to proteases is of decisive importance. 
A human lysozyme standard and lysozyme from rice were diluted to a final concentration 
of 50 pg/ml in PBS and subjected to the following thermal treatment in a sequential 
mode: (1 ): 62 °C for 15 minutes; (2): 72 C for 20 seconds; (3): 85 C for 3 minutes and 
finally; (4): 100 C for about 8 to about 20 seconds. Studies were conducted with 100 pi 

20 per tube and repeated three times. Aliquots were saved at the end of each treatment 
and the remaining lysozyme activity was measured by activity assay. The result showed 
that recombinant lysozyme exhibited the same degree of thermal stability in the 
temperature range from 62 °C to 100 °C as human lysozyme. 

In another embodiment, approximately 50 pi of Hlys or rHIys was dissolved in 

25 PBS at 100 pg/ml and subjected to heat treatment. Four different temperatures of 65°C, 
72°C, 85°C and 100°C were tested. With each temperature, 0 min, 0.33 min, 1.5 min, 3 
min, 5 min and 15 min were selected to analyze the impact of incubation time on the 
stability of lysozyme (Fig 5A). 

For studies on pH stability, lysozyme was dissolved in 0.9% NaCI at 100 pg/ml at 

30 pH 10, 9, 7.4, 5, 4, and 2. The solutions were incubated at 24 C for one hour. 

Experiments were conducted with 200 pi per tube and repeated three times. Remaining 
lysozyme was detected by lysozyme activity assay. 

For pH treatments at pH 2, 4 and 5, Hlys and rHIys was dissolved in PBS 
adjusted to the corresponding pHs with HCI at 1 00 pg/mL For pH 9 and 1 0, lysozyme 

35 was dissolved in TBS and 150 mM sodium carbonate/bicarbonate at 100 pg/ml,- 
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respectively. Approximately 100 m' of lysozyme solution was incubated at 37°C for 30 
min. The lysozyme activity was assessed by activity assay (Fig 5B). 
Both Hlys and rHIys displayed similar thermal and pH stability. 
(Hi). Determination of in vitro protease resistance of Ivsozvme 
5 Lysozyme was dissolved in 0.9% NaCI at 100 pg/ml. The pH of the solution was 

reduced to 3, 4 and 5 with HCI. Pepsin (Sigma, MO) (pepsin: lysozyme = 1:22 (w/w)) 
was added and the solutions were incubated at 37°C for one hour. Then the pH of all 
treatments was raised to pH 7 with bicarbonate. Pancreatin (Sigma, MO) (pancreatin : 
lysozyme = 1:110 (w/w)) was added to the neutral solution and incubated at 37 C for two 
10 hours. The remaining lysozyme activity was measured by activity assay. 

In in vitro digestion experiments with pepsin and pancreatin, the native and 
recombinant human lysozyme displayed very similar resistance to pepsin and pancraetin 
digestion. Under these conditions, human albumin was degraded as demonstrated by 
SDS-PAGE (data not shown). 
15 (iv). Biochemical Characterization of Lysozyme 

After recombinant human lysozyme was purified to near homogeneity, several 
biochemical characterizations were carried out to compare human milk lysozyme with 
recombinant human milk lysozyme derived from rice cells. The results summarized in 
Table 3 show that by SDS-PAGE, native human milk lysozyme and recombinant 
20 lysozyme migrated to the same position. 

Nucleotides encoding the rice RamyZD signal peptide were attached to the 
human lysozyme gene in the expression vector pAPI156. Determination of the N- 
terminal amino acid sequence of the purified recombinant human lysozyme revealed an 
N-terminai sequence identical with that of native human lysozyme, as detailed above. 
25 Rice cells thus cleave the correct peptide bond to remove the RAmySD signal peptide, 
when it is attached in the human lysozyme precursor. 

The overall charge of recombinant and native human lysozyme were compared 
by isoelectric-focusing (IEF) gel electrophoresis and pi values determined. Since 
lysozyme is a basic protein with a calculated pi of 10.20, the pi comparison studies were 
30 carried out by reverse IEF gel electrophoresis. Recombinant and native human 

lysozyme displayed identical pi, indicating the same overall charge (data not shown). 

Recombinant human lysozyme derived from transgenic rice had a specific 
activity similar to the native lysozyme (200,000 units/mg (Sigma, MO), whereas, 
lysozyme from chicken egg whites had the expected 3-4 fold lower specific activity 
35 (Sigma, MO) (Fig. 4). 
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Table 3. Comparison of Biochemical Characteristics of Human Milk Lvsozvme and 
Recombinant Lvsozvme 



Lysozym 
e source 


N-terminal 
sequence 


Size 
(kDa) 


Giycosyla 
tion 


Specific 

activity 

(units/mg) 


pi 


Human 
milk 


KVFER C ELART 


14 


No 


201,526 


10.2 


rice 


KVFER(~)*ELART 


14 


No 


198,000 


10.2 



5 This cycle was not defined, and could be due to the un-modified cysteine 

residue which cannot form a stable derivative in Edman degradation analysis. 

The results described above demonstrate the ability to use rice cells as a 
production system to express human lysozyme from milk. Over 160 individual 
transformants were screened by immunoblot, enzymatic activity assay and ELISA. 

10 Yields of recombinant human milk lysozyme reached 4% of soluble cell proteins in 

culture cells and over 40% of soluble proteins in rice grains. Although the mechanism is 
not part of the invention, the high expression level may be explained by the utilization of 
the strong RAmy3D promoter (Huang et a/., 1993) in culture cell system and Gt1 
promoter in grain expression system and the codon-optimized gene. 

1 5 The plant derived human milk lysozyme obtained by the methods of the present 

invention was identical to endogenous human lysozyme in electrophoretic mobility, 
molecular weight, overall surface charges and specific bactericidal activity, 

EXAMPLE 4 

20 Characterization of Recombinant Human Lactoferrin (rLF) Produced by Transgenic Rice 

Plants 

A. Southern blot analysis 

About three grams of young leaf were collected and ground with liquid nitrogen 
into a very fine powder. The DNA was isolated according to the procedure as described 
25 in Dellaporta et aL, 1983, and purified by phenol-chloroform extraction. Approximately 5 
fig of ECoRI and H/ndlll digested DNA from each line was used to make blot for 
Southern analysis. The ECLTM direct nucleic acid labeling and detection system 
(Amersham, USA) was used for analysis. 

The lactoferrin gene copy number was estimated to be from about 1 to about 10 
30 as determined by Southern blot hybridization using EcoRi and H/ndlli digested genomic 
DNA. The API1 64-1 2-1 (R0) transgenic plant line was subjected to Southern analysis 
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together with ten Western blot positive, field grown R1 lines. A typical Southern blot 
shows that there are at least three fragments above the original plasmid derived plant 
transformation unit (3156 bp). All the LF inserts appear to be inherited from the original 
R0 transgenic plant event to R5 generation. 

5 

B. Protein Isolation and Western blot 

Rice seeds were ground with 1 ml of 0.35 N NaCI in phosphate buffer saline 
(PBS), pH 7.4 using an ice-cold mortar and pestle and the resulting homogenate was 
centrifuged at 15000 rpm for 15 min at 4 C C. The supernatant was used as a protein 
10 extract and about 1/25 or 1/50 of the salt soluble content was loaded onto a 10% pre 
cast gel (Novex, USA) and electrophoresis was carried according to the manufacturer's 
instructions. For total protein detection, the polyacrylamide gel was stained with 0.1% 
Coomassie brilliant blue R-250 (dissolved in 45% methanol and 10% glacial acetic acid) 
for at least three hours and destained with 45% methanol and 10% glacial acetic acid 
15 until the desired background was achieved. 

For Western blot analysis, SDS-PAGE gels were electroblotted onto a 0.45 pm 
nitrocellulose membrane with a Mini-Trans-Blot Electrophoretic Transfer Cell System 
(Bio-Rad, USA) and subsequently subjected to immuno-blotting analysis. The blot was 
blocked with 5% non-fat dry milk in PBS for at least two hours followed by three washes 
20 with PBS for 10 minutes each. The primary rabbit polyclonal antibody against hLF 
(Daka A/S, Denmark) was diluted at 1 :2500 in the blocking buffer and the blot was 
incubated in the solution for one hour. The blot was washed with PBS for three times 
with 10 minutes each. The secondary goat anti-rabbit IgG (H+L)-alkaline phosphatase 
conjugated (Bio-Rad, USA) was diluted in the blocking buffer at 1:5000 ratio. The 
25 membrane was incubated in the secondary antibody solution for one hour and followed 
by three washes with PBS. Color development was initiated by adding the substrate 
system BCIP-NBT (Sigma, USA) and the process was stopped by rinsing the blot with 
H20 once the desirable intensity of the bands was achieved. 

One hundred eight (108) R0 plants were grown to maturity, seeds were 
30 harvested from 56 fertile plants and individual seeds analyzed by Western blot to detect 
the expression of rLF. Coomassie blue staining was carried out to compare the mobility 
of rLF with native human lactoferrin (hLF) (Fig. 8), with 40 pg of total protein loaded onto 
each lane, along with 40 ng of native purified hLF per lane as the positive control. 

Estimation of total rLF by ELISA indicated that from 93 pg to 130 pg rLF was 
35 expressed in transformed rice seeds. A typical Western blot analysis (Fig. 9) illustrates 
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that both rLF and native hLF migrate at approximately the same rate with the molecular 
weight about 80 kDa, consistent with that determined by other researchers (Wang et a/., 
1984). 

5 C. Protein purification 

Rice seeds from R2 homozygous generation were dehusked and milled to flour 
conventionally. Recombinant lactoferrin was extracted by mixing the rice flour with 0.35 
N NaCI in PBS at 1 00 g/l at room temperature for two hours. The resulting mixture was 
centrifuged at 15,000 rpm for one hour at 4°C. The collected supernatant was subjected 

10 to the following steps of filtration before loading onto a Sepharose column. First, the 
supernatant was run through a few layers of cheesecloth. Then the filtrate was passed 
sequentially through an 8Mm paper, 1 pm paper and a 0.25pm nitrocellulose membrane. 
The clear protein solution was loaded onto a ConA Sepharose column (Pharmacia, XK 
26) which had been equilibrated with 0.5 N NaCI in 20 mM Tris, pH 7.4 (binding buffer) 

15 at a flow rate at 4 ml/min. After the loading was complete, the column was washed with 
binding buffer until the baseline at A280nm was achieved. Lactoferrin was eluted with 
0.1N mannoside in the binding buffer. Fractions containing lactoferrin were pooled and 
loaded onto a second column SP-Sepharose (Bio-Rad, USA) which has been 
equilibrated with 0.4 N NaCI in 50 mM sodium phosphate, pH 8.0 (binding buffer B) at 

20 the flow rate 4 ml/min. Then the column was washed with the binding buffer B until the 
baseline at A280 nm was obtained. Lactoferrin was eluted by 1 N NaCI in 50 mM 
sodium phosphate, pH 8.0 and the fractions containing LF were pooled and diatyzed 
against PBS. Finally the purity of LF was assessed by SDS-PAGE and stored at - 80°C. 
In another embodiment, recombinant human lactoferrin (rHLF) was extracted by 

25 mixing rice flour with 0.35 M NaCI in PBS at 75 g/L at room temperature for 2.5 hours. 
The extract was passed through six layers of cheesecloth before centrifugation (10,000 
g for 1 hour at 4°C). The supernatant was recovered and the NaCI concentration was 
adjusted to 0.4 M (pH 8.0). After a second centrifugation at 10,000 g for 10 minutes at 
4°C, the supernatant was collected and filtered through 0.45 pm nitrocellulose 

30 membrane. The filtrate was loaded onto a SP-Sepharose column (Bio-Rad, Hercules, 
CA) which had been equilibrated with 0.4 M NaCI in 50 mM sodium phosphate, pH 8.0 
(binding buffer) at a flow rate of 4 ml/min. The column was washed with the binding 
buffer until baseline A280 was obtained. Lactoferrin was eluted by a linear gradient and 
dialyzed against PBS. The purified rHLF was analyzed by SDS-PAGE and stored at - 

35 80 e C. 
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D. Enzyme Linked Immunosorbant Assay (ELISA) 

ELISA was conducted using seed extracts, isolated as described above, with 
total protein assayed using the Bradford method (Bradford, M., 1976). The ELISA was 
based on a typical sandwich format generally known in the art. Briefly, 96 well plates 
5 were coated with rabbit anti-human iactoferrin antibody (Daka A/S, Denmark), then rLF 
and control samples were added to individual wells of the plate and incubated for 1 hour 
at 35°C. Rabbit anti-human Iactoferrin horseradish peroxidase conjugate (Biodesign, 
USA) was then added to each well and incubated for 1 hour at 35°C, followed by 
addition of the tetramethylbenzidine substrate (Sigma, USA) and incubation for 3 
10 minutes at room temperature. The reaction was stopped by adding 1N H2S04 to each 
well. The plates were read at dual wavelengths of 450 and 650 nm in a Microplate 
Reader (Bio-Rad, model 3550) and the data was processed by using Microplate 
Manager HI (Bio-Rad). The results of an analysis of 10 homozygous selected lines 
showed that from 93 pg to 130 M9 rLF was expressed per seed. 

15 

E. Selection of plants for advance generations 

At least 20 - 40 seeds from 1 1 independent lines were analyzed. Individual R1 
seeds were cut into half and endospermic halves were subjected to analysis by Western 
blot with the positive corresponding embryonic halves germinated on 3% sucrose 

20 medium with 0.7% agar. The seedlings were transplanted to the field for R1 generation. 
Out of 1 1 individual lines, 3 lines were expressed. A total of 38 plants were grown in the 
field derived from the 3 expressed mother lines. Based on the agronomic character 
(Table 4) of those 38 plants, 28 plants were selected. 

It was observed that all the Western positive R1 seeds were opaque to pinkish in 

25 color in comparison to control seeds, so this criterion was applied in screening the R2 
seeds. Mature R2 seeds were harvested at maturity and dehusked. The pinkish R2 
seeds were confirmed by Western dot blot and ELISA as expressing rLF (data not 
presented). Finally 10 homozygous R2 lines were selected and grown in the field in 
order to advance the generation. 
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Table 4. Comparison Of Phenotypic Characteristics Of Native TP-309 And Transformed 
TP-309 Rice Seeds 



Source 


Effective 
tiller 


Blank 

grain (%) 


1000 seed 
weight (g) 


ug of rLF/seed 


TP-309 


43 


5.0 


25 




Homozygous 
transgenic lines 


42 


19.7 


20.2 


125 



5 During R2 and R3 generation the percentage of blank seeds was higher in 

homozygous transgenic lines than in the non-transgenic control. This affected the 1000 
seed weight. However, in the R4 generation no significant differences in phenotypic 
character were observed in homozygous transgenic lines when compared to non- 
transformed TP309 (Table 4). 

10 

F. Attributes Of Recombinant Human Lactoferrin Produced In Rice 

Physical characterization of the rLF showed there was no significant difference 

between the rLF and a commercially available purified form of hLF based on N-terminal 

amino acid sequencing, and physical characteristics of rLF such as molecular weight as 
15 determined by MALDI-MS, HPLC profile of which showed a comparable peptide map, 

pH dependent iron release and bacteriostatic activity, using the analyses described 

below. 

(ii N-Terminal Amino Acid Sequencing 

Purified rLF from rice seeds was resolved by 10% SDS-PAGE, followed by 
20 electroblotting to PVDF membrane (Bio-Rad, USA). The target band was identified by 
staining the membrane with 0.1% Coomassie brilliant blue R-250 in 40% methanol and 
1% glacial acetic acid for 1 minute. The stained PVDF membrane was immediately 
destained in 50% methanol until the band is clearly visible. The blot was thoroughly 
washed with ddH20 and air dried. Finally this sample was sent to the Protein Structure 
25 Laboratory in University of California at Davis (CA, USA) for sequencing analysis, 
(ii). Detection of qlvcosvlation and determination of sugar content 
Glycosylation of the recombinant human lactoferrin produced in rice was 
analyzed by an immunoblot kit for glycoprotein detection (Bio-Rad, USA) per instructions 
from the manufacturer. An increase of molecular weight of lactoferrin due to 
30 carbohydrate content was determined by Matrix Assisted Laser Desorption lonization- 
Mass spectrometry (MALDI-MS) (PE Applied Biosystems, Voyager System). 
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Recombinant lactoferrin produced in rice is glycosylated as evident from the 
binding to Con A resin, the positive staining by glycoprotein detection kit as well as the 
larger detected mass as compared to the calculated mass (76.2 kDa) based on the 
peptide backbone. MALDI-MS showed that seed derived recombinant lactoferrin has 
5 molecular weight of 78.5 kD while human milk lactoferrin is 80.6 kDa (Table 5). The 
difference could be due to the lesser degree of glycosylation in the rice seed-derived 
lactoferrin. Analysis shows that the purified rHLF contains xylose but lacks sialic acid, 
which is consistent with plant post-translational modification patterns (Matsumoto et a/., 
1995). 

10 (iii). Determination of isoelectric point of Lactoferrin 

Reverse isoelectric focusing (IEF) gel electrophoresis was carried out with a 
precast Novex IEF gel, pH 3-10 according to the manufacturer* instruction. About 30 \jg 
of purified rLF was loaded and the running condition was 100 V for 50 minutes and 200 
V for 20 minutes. The gel was then fixed in 136 mM sulphosalicylic acid and 1 1 .5% TCA 

15 for 30 minutes, stained in 0.1% Coomassie brilliant blue R-250, 40% ethanol, 10% 

glacial acetic acid for 30 minutes and destained in a solution containing 25% ethanol and 
8% acetic acid. 

(iv). Comparison of physical characteristics of rLF with native hLF 

The HPLC profile of native and rLF showed a comparable peptide map. This 

20 confirmed that LF from the two sources have an identical amino acid sequence (data not 
presented). Additional comparisons confirm that human lactoferrrin produced in 
transgenic rice closely resembles native human lactoferrin, as evidenced by (1) the N- 
terminal sequence of purified rLF from homozygous R2 seeds and hLF (Dakao A/S, 
Denmark), which were shown to be identical (Table 5); (2) the isoelectric point (pi) of 

25 native and rice seed derived LF which is the same, indicating that they have similar 

surface charges (Table 5); (3) the pH dependent iron release of rLF which was shown to 
be closely related to that of native hLF (Fig. 1 1 and see section vii of example 4); and (4) 
the bacteriostatic activity of rHLf which was shown to be similar to that of native human 
lactoferrin (nHLf) on enteropathogenic E. coli (EPEC; Fig. 10) and confirmed the 

30 presence of active recombinant LF in extracts derived from transformed rice seeds (see 
section ix of Example 4). 
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Table 5. Physical characterization data for human (hLFVand rice seed derived 
recombinant lactoferrin (rLF) 



LF 

source 


Size 
(kDa) 


N-terminal sequence 


Pi 


Glycosy- 
lated 


Sugar 
content (%) 


hLF 


80.6 


GlyArgArgArgArgSerValGlnTrpCysAla 


8.2 


YES 


5.5 


rLF 


78.5 


GlyArgArgArgArgSerValGlnTrp( )Ala 


8.2 


YES 


2.9 


(V). 


Iron content and nutrient value determination of rice seeds 





The iron content of R2 homozygous seeds was determined. Two grams of dry mature 
seeds from each transformed and non-transformed line were weighed and wet-ashed with 
HN03 and H202 solution at 1 1CTC (Goto et a/., 1999). The ash was dissolved in 1N HCI 
solution. The iron content was then measured by absorbance of Fe-O-phenanthroIin at 510 
1 0 nm, using a Sigma kit (Sigma, USA) per instructions of manufacturer. 

The different values of nutrient facts of homozygous transgenic seeds and non 
transgenic seeds were measured by standard procedure at A & L Western Agricultural 
Laboratories (Modesto, CA, USA). 

A comparative analysis of transgenic lactoferrrin-expressing rice seeds with non 
1 5 transformed native Teipei-309 showed that there is no significant difference between 
transformed and non transformed seeds in nutrient value with the exception that the 
concentration of iron is 50% greater (Table 6). The increased level of iron may be the reason 
for the opaqueness and pink coloration of the rLF expressing transgenic rice seeds. 

In another embodiment, 0.2 grams of dried, dehusked grains expressing rHLF were 
20 wet-ashed with concentrated HN03 for two days and dissolved in 5 ml of DDI H20. The iron 
contents of the samples were measured by flame atomic absorption spectrophotometry 
(Thermo Jarrel Ash SH4000, Franklin, MA). NIST liver was analyzed concurrently to verify the 
accuracy of the standard curve. 

The iron content of transgenic rice grains was more than twice that of non-transformed 
25 TP309 grains, while there were no significant differences in other tested nutrition factors 
between transformed and non-transformed grains (Table 7). This suggests that groups 
ingesting transgenic rice with rHLF will increase the iron intake, 

The transgenic grains with increased iron content were opaque-pinkish in color. The 
opaque-pinkish color was observed inside as well as outside the rice endosperm. This 
30 opaque-pinkish color, segregated in Mendelian fashion, was linked with expression of rHLF 
and was inherited through the R4 generation. 
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There was no difference noticed during the seed germination of transgenic seeds, the 
phenotype of R2 R3 and R4 plants was vigorous and the seed yield was similar to that of non- 
transgenic Teipei-309 plants (data not shown). 

5 Table 6. Comparison of Nutrition Value (in mq) Per 100 Gram of Non Transformed and 
Transformed Rice Seeds 



Source 


Carbohy- 
drate 


Protein 


Fat 


Ca 


K 


Na 


Fe 


Water 


Calories 


TP-309 


76.0 


8.7 


2.4 


9 


370 


<10 


0.8 


11.3 


369 


Homozygous 
transformed 
I lines 


75.7 


8.7 


2.2 


8 


330 


<10 


1.2 


11.8 


367 



Table 7. Comparison of Mineral Contents (in uq) Per Gram of rHLF-Transformed and 
10 Non-Transformed Rice Grains 



Source 


Cu 


Fe 


Mn 




Non-transformed 


2.9 


8.7 


33.1 


20.8 | 


Transformed 


4.7 


19.2 


17.7 


28.7 | 



(vi) . Tissue specificity and stability of rLF 

An endosperm specific rice glutelin promoter was used to express recombinant 
15 lactoferrin in maturing or matured seeds. To confirm the tissue specificity of the 

expressed lactoferrin, protein was extracted from root, shoot, leaf beside mature seed 
and subjected to Western blot and the results indicated that there was no detectable 
expression of rLF except in the seed/endosperm (Fig. 9). Furthermore, the presence of 
rLF in 5 day old germinated seeds showed the stability of stored rLF within the plant cell 
20 during germination. 

(vii) . Iron Saturation And pH Dependent Iron Release 

Lactoferrin was incubated with 2M excess ferric iron (FeCI3 :NTA = 1:4) and 
sodium bicarbonate (Fe:HC03- = 1 :1 ) for 2h at room temperature. Excess free iron 
was removed by using a PD-10 desalting column (Pharmacia, USA) and the iron 
25 saturation level was determined by the A280/A456 ratio. Both native hLF and rLF were 
completely saturated by iron. Holo hLF was incubated in buffers with a pH between 2 
and 7.4, at room temperature for 24 h. Free iron released from hLF was removed and 
the iron saturation level was determined by A280/A456 ratio. 
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The results showed that iron release was similar for both hLF and rLF. Iron 
release began around pH 4 and was completed around pH 2 (Fig. 11), The iron binding 
was reversible since iron-desaturated rLF was re-saturated by raising the pH to 7 (data 
not shown). The similarity in pH dependent iron release of rLF to that of the hLF 
5 standard demonstrated that rLF is able to adapt the appropriate tertiary structure for 
proper iron binding and release (Salmon, Legrand ef a/. 1997). 

(viii). Binding and uptake by Caco-2 cells 

50,000 Caco-2 cells/well were seeded and grown in Minimum Essential Medium 
(GIBCO, Rockvilie, MD) containing 10% fetal bovine serum in 24 or 48 well tissue 

1 0 culture plates for 3 weeks. For binding studies, Caco-2 cells were incubated with 
varying concentrations (0-2 pM) of 1251-HLf in the presence or absence of 100-fold 
excess of unlabeled nHLf for 2 hours at 4 oC and cells were washed 5 times with ice- 
cold PBS. Cells were solubilized with 0.5 ml of 0.1% SDS and radioactivity was 
quantified in a gamma counter. For uptake studies, 0.4 pM of 1251-HLf was incubated 

15 with Caco-2 cells for 0 to 24 hours at 37 °C and cells were washed, dissociated by the 
same way as in the binding study. 0.5 ml of 24% TCA solution was added to the 
dissociated cells and free iodine was removed by the centrifugation. Free and protein- 
bound 125 l were quantified separately to evaluate how much of HLf was degraded in the 
cells. Receptor-binding of rHLf to the human intestinal Caco-2 cell line was saturable 

20 and specific, indicating that rHLf bound to the Lf receptor. The binding constant was 
similar for rHLf and nHLf, but the number of binding sites was slightly higher for rHLf, 
which may be due to the difference in glycosylation. Uptake of HLf by Caco-2 cells was 
identical for rHLf and nHLf. 

(be). In vitro Digestion: effect on antimicrobial activity and bindinq/uptake to 

25 Caco-2 cells 

Lactoferrin is known to inhibit the growth of a variety of bacterial species based 
on its iron chelation and direct bactericidal properties. The anti-microbial effect of rLF 
extracted from rice seeds was tested following treatment using an in vitro digestion 
model with an enzymatic system containing pepsin (an enzyme active in stomach) and 

30 pancreatin (an enzyme active in deodenum). 

LF proteins were dissolved in PBS at 1mg/ml, and either left untreated, pepsin 
treated (0.08mg/ml at 37°C for 30 min), or pepsin/pancreatin treated (0.016 mg/ml at 
37°C for 30 min). LF proteins were sterilized by passing through a membrane filter with 
a pore size of 0.2 jim [Rudloff, 1992]. The filter sterilized LF (O.S^ig/ml) was incubated 

35 with 104 colony forming unit (CFU) enteropathogenic E. coli (EPEC)/^I in 100 jal sterile 
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synthetic broth (1.7% ; AOAC) containing 0.1% dextrose and 0.4 ppm ferrous sulfate at 
37°C for 12h and colony forming units (CFU) were determined. 

Starting with an enteropathogenic E. coli (EPEC) concentration of 10 4 CFU 
(colony forming units), the untreated samples of rLF reached up to 10 6 5 CFU after 12 h 
5 of incubation at 37°C in comparison to hLF, which produced up to 10 6 CFU. An in vitro 
digestion model using an enzymatic system containing pepsin (enzyme active in 
stomach) and pancreatin (enzyme active in deodenum) with moderate shaking to 
imitate the transit of protein through infant gut [Rudloff, 1992] was used. rLf and nHLf 
were treated with active pepsin and pancreatic enzymes and exposed to 10 4 CFU 
10 EPEC cells for 12 h at 37°C (Fig. 10). Both the native human lactoferrin standard 
(nHLf) and the recombinant rice-derived lactoferrin (rLf) remained active in inhibiting 
growth of enteropathogenic E. coli, indicating that both nHLf and rHLf are resistant to 
protease digestion. 

SDS-PAGE and ELISA revealed that nHLf and rHLf resist digestion by pepsin 

15 (at pH 3.8) and pancreatin, whereas human serum albumin is completely digested after 
in vitro digestion. Western blots revealed that immunoreactivity was also maintained 
after digestion. Although some smaller molecules were generated during 

digestion of HLf, most of the immunologically detectable HLf retained its intact size. 
More than 50 % of rHLf and nHLf was immunologically detectable by ELISA, but 125 l- ; 

20 HLf was around 40 % and 59 Fe-HLf was only 20 % detectable, indicating that ELISA 
detects small peptide fragments of HLf, which are removed by the PD-10 column and 
that about 50-60 % of Fe was released from detectable HLf after in vitro digestion. The 
iron-holding capacity was not significantly different. 

The dissociation constant (Kd) and the number of binding sites for HLf to its 

25 receptor were determined from the binding study. Both Kd and the number of bindings 
sites were not significantly different between nHLf and rHLf after in vitro digestion 
(Figure 12A, 12B). Digestion did not appear to affect on the Kd but made the number of 
binding sites much lower. Total Lf uptake was not significantly different between nHLf 
and rHLf after in vitro digestion (Figure 12C), though uptake was about one third when 

30 compared with undigested nHLf. Total iron uptake from nHLf was twice as high as that 
from rHLf. Percent degradation of HLf was similar regardless of digestion or not, and 
the native or recombinant form (Figure 12D). 

(x).' Thermal Stability: effect on antimicrobial activity and binding/uptake to 
Caco-2 cells 
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1.0 mg/ml of holo-HLf in PBS was treated by the following conditions: (a) 62 °C 
for 15 minutes, (b) 72 °C for 20 seconds, (c) 85 °C for 3 minutes, or (d) 100 oC for 8 
seconds. Survival ratio of HLf determined by ELISA were more than 90% following 
treatment at 62 °C for 15 minutes, at 72 °C for 20 seconds, or at 85 °C for 3 minutes, 
5 but it was considerably lower after 100 °C for 8 seconds. This high temperature 
precipitated both types of HLf and only 10% of HLf was detectable by ELISA. More 
than 80% of iron was still bound to both rHLf and nHLf after all thermal treatments with 
the exception of 100 °C for 8 sec. In 1 0% of survived HLf after 100 °C for 8 sec, the 
iron saturation level of nHLf was above 80% whereas that of rHLf was only about 40%. 

10 SDS-PAGE and Western blots revealed no difference in immunoreactivity 

between nHLf and rHLf at 62 °C for 1 5 minutes, at 72 °C for 20 seconds, and at 85 °C 
for 3 minutes, but at 100 °C for 8 seconds, rHLf almost completely lost its 
immunological activity, whereas nHLf still maintained detectable immunoreactivity. 

There was no significant difference in anti-microbial activity between nHLf and 

15 rHLf after heat-treatment. Anti-microbial activity of HLf was not affected by treatment at 
either 62 °C for 1 5 min, 72 °C for 20 sec or 85 °C for 3 min. 

The Kd and the number of binding site for nHLf and rHLf were not significantly 
different at 62 °C and 72 °C though there is a trend that nHLf is somewhat lower Kd and 
binding sites than rHLf. As the temperature was increased (such as 85 °C and 100 °C), 

20 more rHLf bound to Caco-2 cells, most likely by non-specific binding due to more rHLf 
being denatured than nHLf. Uptake properties were similar for nHLf and rHLf even in 
the group treated at 100 °C where uptake of both types of HLf was highest among all 
the thermal treatments. Free iodine levels in the cells were also evaluated since it 
reflects degradation of HLf. About 20% of HLf was degraded in the untreated sample. 

25 There was no significant difference between nHLf and rHLf. Interestingly, samples 
treated at 100 °C were degraded twice as much as untreated samples of nHLf and 
rHLf, which may indicate that denaturation of HLf caused by heat treatment will make 
the protein more susceptible to proteases in the cells. 

(xi). pH Stability: effect on antimicrobial activity and binding/uptake to Caco-2 

30 cells 

1 .0 mg/ml of holo-HLf in PBS was adjusted to pH 2, 4, 6, or 7.4 by the addition 
of 1 M HCI and incubated for 1 h at room temperature. The pH was then adjusted to 
7.0 with 1 M NaHCG3. Free iron released from HLf, was removed by a desalting 
column. 
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After low pH treatment, 100% of both nHLf and rHLf survived. The iron-holding 
capacity was maintained in all samples and the iron saturation level was above 95%. 
SDS-PAGE and Western blots revealed that there was no difference between nHLf and 
rHLf for any of the treatments. A slightly smaller immunoreactive molecule (-70 kD) 
5 was detected after exposure of nHLf to pH 2 and 4 and of rHLf to pH 2. 

Antimicrobial activities of nHLf and rHLf were stable after exposure to low pH in 
the range of pH 2.0 to 7,4. As the pH was lowered, the activity of rHLf appeared to be 
higher and constant, whereas nHLf did not show any pH dependency. 

Kd and the number of binding sites for nHLf were not significantly different from 
10 those for rHLf but a trend was always lower for nHLf within the range of pH 2.0 to 7.4, 
which is similar to control and thermal treatment samples. The Kd and the number of 
binding sites for nHLf and rHLf were unaffected by pH treatment down to 2.0 for 1 hour. 
Uptake properties were similar for nHLf and rHLf in the pH range of 2.0 to 7.4. 
Degradation of HLf in Caco-2 cells was also evaluated and there was no significant 
1 5 difference between nHLf and rHLf. 

EXAMPLE 5 

Generation and Characterization of Recombinant Human a- 1 -antitrypsin (AAT) 
Produced by Transgenic Rice Plants 

20 A. Construction and expression of human AAT in rice cells 

The construction and purification of functional recombinant human AAT were 
carried out as exemplified in previous examples. Briefly, codon-optimized AAT gene 
was cloned into an pAPI145 that contains the rice Gt1 promoter, Gt1 signal peptide, 
and Nos terminator, pAPI241 that contains Gib promoter, Gib signal peptide, and Nos 

25 terminator, and API280 that contains Bx7 promoter, Bx7 signal peptide, and Nos 

terminator, as exemplified in Example 1. The resulting plasmids were named pAPI250, 
API255 and pAPI282, respectively (Figure 13). Transgenic plants expressing AAT were 
generated as above, and plant-generated recombinant AAT was characterized. To 
express AAT in culture cells, codon-optimized AAT gene was cloned into an expression 

30 cassette that contains the rice RAmy3D promoter, signal peptide, and terminator. 

Recombinant AAT expression was induced and secreted to the culture medium under 
the sugar starvation condition. Purification of rAAT was achieved through a scheme 
that consisted of an affinity column (Con A), anion exchange column (DEAE), and a 
hydrophobic interaction column (Octyl). 
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B. SDS- Polvacrvlamide Gel Electrophoresis (SPS-PAGE) 

AAT samples were ground with PBS with mortar and pestle. The resulting 
extract was spun and 20 microliters of supernatant loaded into a precast SDS-PAGE 
gel. The AAT protein was clearly visualized with Coomassie brilliant blue staining 
5 (Figure 14). 

C. Western Blot Analysis 

For immunoblotting analysis, gels were electroblotted to a 0.45 pm nitrocellulose 
membrane with a Mini Trans-blot Electrophoretic Transfer cell (Bio-Rad, USA) and 

10 subsequently subjected to immunoblotting analysis. Blots were blocked with 5% non-fat 
dry milk in PBS, pH 7.4 for at least two hours followed by three washes with PBS, pH 
7.4 for 10 minutes each. The primary rabbit polyclonal antibody against human alpha-1- 
antitrypsin (Dako A/S, Denmark) was diluted to 1:2500 in the blocking buffer and the blot 
was incubated for at least one hour. The blot was then washed as described previously. 

15 The secondary antibody, goat anti-rabbit IgG (H+L)-alkaIine phosphatase conjugated 
(Bio-Rad), was diluted in the blocking buffer at a dilution of 1:4000. The membrane was 
then incubated in the secondary antibody solution for one hour and followed by the 
same wash process. Color development was initiated by adding the substrate 
BCIP/NBT from Sigma. 

20 The western result showed that AAT protein is clearly visualized and confirmed 

that AAT expressed and deposited in transgenic rice grain, has a molecular weight that 
is somewhat smaller than that of native AAT (Figure 15). 

D. ELISA 

25 Standards for this assay ranged from 1 .25 - 20 ng/mL of AAT (Athena) diluted in 

PBST. Nunc Immuno-plate Maxisorp 96-well plates (Nunc, Denmark) were coated for 
16 h at 4°C by a 1:10,000 dilution of rabbit anti-human AAT in 0.05 M sodium 
bicarbonate, pH 9.6. The plates were washed 3 times with PBST (PBS, pH=7.4, 0.05% 
Tween-20) and subsequently incubated with sample for 1 h at room temperature while 

30 rocking. The plates were washed again 3 times with PBST, followed by incubation with 
a 1:50,000 dilution of goat anti-human AAT conjugated to HRP for 1 h at room 
temperature. The plates were washed 3 times with PBST, and bound antibody was 
detected by the HRP/ hydrogen peroxide catalyzed reaction of TMB. The reaction was 
stopped with 2 M sulfuric acid, and the plates were read on a microtiter plate reader at 

35 450 nm, using 620 nm as a reference filter. 
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Recombinant AAT is 2.1 times more immunoreactive, when comparing equal 
concentrations as determined by the Lowry assay. 

E. AAT activity assay 

5 AAT activity was analyzed using a modified method published by Travis and 

Johnson (1981). In 96-well microtiter plates, 60 jiL samples diluted in Tris buffer, (0,2 M 
Tris, pH 8.0) were added. In each well, 60 \iL of elastase [0.01 mg/mL porcine 
pancreatic elastase (PPE) in Tris buffer] was also added. The plate was rocked for 5 
min at room temperature to allow any available AAT to bind to the elastase. Another 

10 120 nL of substrate solution (10 M N-Succinyl-AAA-p-nitroanilide in DMSO diluted in 
Tris buffer to give 0.33 M N-Succinyl-AAA-p-nitroanilide) was added, and the plate was 
rocked for 1-2 min at room temperature. The plate was immediately read on a microtiter 
plate at 405 nm. The plate was read again after 5 min, and the change in absorbance 
was calculated. AAT activity was determined using linear regression from a standard 

15 curve. The results show that AAT protein produced in rice grain has similar bioactivity 
as that of native AAT. 

F. Band shift assay 

The unique property of the covalent-linked complex formed between AAT and 
20 PPE permits an analysis of the activity of AAT by SDS-PAGE. Briefly, 20 Ml of tested 
samples containing AAT from the screening or purification processes was incubated 
with 100 ng PPE at 37°C for 15 minutes. Five pi of SDS-foading dye was added and the 
reaction mixture boiled for five minutes. The sample was then centrifuged and kept on 
ice until loaded onto a 10% precast SDS-PAGE gel. The resulting gel was stained with 
25 0.1% Goomassie brilliant blue R-250 as described below. For immunodetection, a 
western blot analysis was carried as described above. Again the band shift assay 
indicated that AAT protein produced in rice grain has similar bioactivity as that of native 
AAT (Fig. 16A and 16B). 

30 G. In vitro digestion . 

The digestion was carried out using a modified method of Rudloff and Lonnerdal 
(1992) was used after some modifications. Native and recombinant AAT were diluted in 
PBS or formula to 0.5 mg/mL. Hydrochloric acid (1 M) was added to all samples to 
adjust the pH 3, 4, and 5, then 2.5 \xL of 2% pepsin in 0.01 M HCI (3,100 U/mg solid) 

35 were added and all samples were placed in a shaking incubator for 30 or 60 min at 
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37°C. The pH was restored by drop-wise addition of 1 M NaHC03, and 2.5 of 0.4% 
pancreatin in 0.1 M NaHC03 were added. Samples were incubated for 1 or 2 hours at 
37°C, and the reaction was halted by dilution 1 :2 in sample buffer and boiling for 3 min. 
For samples subjected to pepsin digestion only, boiling was unnecessary since the 
5 pepsin was inactivated when the pH was raised above pH 6 with NaHC03 (Piper and 
Fenton, 1965). The enzyme: substrate ratio was approximately 1:20 for samples in 
buffer only and about 1:600 for samples in formula. 

A significant amount of recombinant and native AAT survived the in vitro 
digestion, and both forms were more resistant to degradation than human serum 

10 albumin. Digestion with pepsin at pH 4 shows that 65% of recombinant AAT is 

detectable by ELtSA after digestion, which is similar to 67% of native AAT surviving. 
The trypsin assay shows that much of the inhibitory properties of both forms are still 
intact, and the activity assay reveals that 63% and 59% of the activity of native and 
recombinant AAT remains, respectively. When exposed to both pepsin and pancreatin 

15 in buffer, native AAT resisted degradation when the pH of the pepsin incubation was pH 
4 or higher. Under this condition, the recombinant form was less resistant, although a 
large part remained after pepsin digestion at pH 5 and pancreatin digestion. At pH 4, 
more of the recombinant protein was degraded, either due to pepsin activity or pH 
instability. AAT activity could not be determined after digestion by pepsin and pancreatin 

20 because of the inactivation of pancreatin by boiling which also inactivates AAT activity. 
In formula, both forms appeared to be equally resistant to degradation. While both 
native and recombinant AAT were still present after pepsin digestion at pH 5 followed by 
pancreatin digestion, bands at about 33 kD (casein) are faint or missing. It is possible 
that other proteins in formula are preferentially cleaved, reducing the amount of AAT 

25 being digested. 

H. Thermal Stability of recombinant human ATT 

Both native and recombinant human AAT were diluted in phosphate buffered 
saline (PBS) or infant formula (Enfamil with Iron, Mead Johnson, Evansville, IL) to a 

30 concentration of 0.1 mg/mL Samples, 100 jjL in capped, 10 x 75 mm glass tubes, were 
treated as follows: 60°C for 15 min, 72°C for 20 sec, 85°C for 3 min, and 137°C 
(temperature of oil bath) for 20 sec. The samples were allowed to cool to room 
temperature after heat treatment. For formula samples with bile extract added, 2.5 juL of 
12% porcine bile extract (Sigma) were added, then vortexed quickly, incubated at 37°C 

35 for 10 min, and vortexed again. All samples were diluted 1:10 in PBS and transferred to 

78 



BNSDOCID: <WO 



02064750A2 I > 



WO 02/064750 



PCT/US02/04909 



1 ,5 mL tubes. Formula samples were centrifuged at 1 5 f O00g for 20 min to remove the 
insoluble fraction, and the supernatant was withdrawn after skimming off the fat All 
samples were subsequently transferred to 1 .5 mL tubes and analyzed. 

The thermal stability of native AAT exceeded that of the recombinant form in 
5 buffer, but the recombinant AAT retained significant stability under most conditions. 
When heated in buffer only, SDS-PAGE and Western blots show that the two forms of 
AAT have similar structural stability. While the ELISA data show that the recombinant 
protein is less stable at the higher temperatures, the recombinant protein is similar to the 
native form under the other conditions. However, the functional stability of the 

1 0 recombinant protein may be affected. The thermal stability assay shows that the 

recombinant protein lost functional ability at several of the heat conditions, whereas the 
native protein was functional at all heat conditions except for at 62°C for 1 5 minutes. 
While the elastase-inhibiting properties of native AAT were about 90% after all heat 
treatments, 62 and 51% of the recombinant protein's activity remained after 85°C, 3 

15 minutes, and 137°C, 20 seconds, respectively. 

The heat treatments of native and recombinant AAT in formula affected the 
detection of the proteins, but the addition of bile extract following heat treatment 
restored antibody recognition of the recombinant form. While the Western blot data 
show less detectable protein only at 85°C, 3 min for the native AAT and at 72°C, 20 sec 

20 and 1 37°C, 20 sec for the recombinant AAT, the ELISA data shows less than 20% 
protein detected for both forms and for all heat conditions. When bile extracts were 
added to the heated formula samples, the ELISA data for the recombinant form showed 
that more than 50% was still detectable after heat treatment. The bite extract did affect 
detection of the native form by ELISA for most of the heat treatments. The Western 

25 blots corroborated the ELISA data and showed that the bile extract may dissociate the 
recombinant AAT from other formula proteins, but it is not effective for native AAT at the 
higher temperatures. 

I. pH stability of human ATT 

30 Native and recombinant AAT were diluted in PBS or formula to 0.1 mg/mL. The 

sample volume was 1 mL, and the pH of each sample was adjusted drop-wise with 1 M 
HCI. The range of pHs tested was from pH 2 to 8 for the samples in PBS and pH 2 to 7 
for samples in formula. After a 1 hour incubation at room temperature, the pH was 
restored to pH 7 with 1M NaHC03. Formula samples were centrifuged as exemplified 

35 in above Thermal Stability section. 
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Both native and recombinant AAT appear resistant to low pH conditions in both 
PBS and formula. There were no differences between treatment groups and controls for 
pH 3 through 7, and controls or between the native and recombinant AAT according to 
SDS-PAGE, Western blots, and trypsin assay. However, the elastase assay and ELISA 
5 data show that recombinant AAT is more affected by acidic conditions than the native 
form. In PBS, native AAT was more than 95% intact, while about 60-80% of the 
recombinant AAT activity was intact. Infant formula may have a stabilizing effect on the 
recombinant protein, since it was found to be as stable as the native form according to 
ELISA and the Western blot. 

10 Native and recombinant AAT can withstand acidic and digestive conditions as 

assessed by SDS-PAGE, Western blots, ELISA and activity assay. Native AAT regains 
much of its structural and functional stability after treatment at acidic conditions followed 
by neutralization, whereas recombinant AAT shows some loss of activity at a wide pH 
range, which may reflect a different glycosylation pattern. The conditions of the infant- 

15 modeled digestion, pH 5 during pepsin treatment, are not ideal for pepsin, which 

normally possesses full activity at pH 2. AAT has been detected in human infant feces, 
which supports the notion that it is capable of surviving digestion in vivo, particularly 
during the first three months of the infantas life. This evidence also supports the validity 
of the in vitro digestion system. It is likely that AAT possesses enough resistance to 

20 acidic and digestive conditions to allow a significant amount to survive and affect the 
digestion process. 

Recombinant AAT remained functionally intact after being exposed to low pH, in 
vitro digestion, and several types of heat treatment. It is therefore possible that 
recombinant AAT may be added to infant formula, can tolerate some processing 

25 conditions, and remain intact in the gastrointestinal tract of infants. Thus, recombinant 
AAT may help protect other physiologically active proteins, such as lactoferrin and 
lysozyme, which also may be added in recombinant forms in the gut of formula-fed 
infants. In conclusion, addition of recombinant AAT together with other recombinant 
proteins may enhance their bioactivity and make the formula more similar to human 

30 milk. 



J. Expression of AAT in transgenic wheat 

The plasmid API282 containing the Bx7 promoter, Bx7 signal peptide and AAT 
gene, Nos terminator and ampicillin resistance gene was used to transform wheat cells, 
35 substantially as in the transformation of rice cells. Twenty one transgenic lines were 
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produced. Expression of AAT was determined to be about 5 to 12 jjg per grain of wheat 
seeds. 

EXAMPLE 6 

5 Generation and Characterization of Recombinant Proteins Produced by Transgenic Rice 

Plants 

A. Generation of recombinant antibodies 

Recombinant antibodies have been expressed in transgenic plants (for 
examples, see Peeters et a/., 2001; Giddings et at., 2000; Larrick et a/., 1998). 

10 However, expression and production of recombinant antibodies in the seeds of 

transgenic plants have certain advantages. The production of high levels of antibodies 
in grains, for example rice grains, provides distinct advantage that food supplements 
may be prepared with little or no purification, and other advantages that are illustrated 
herein the patent application. 

15 In one embodiment, an expression vector is constructed as illustrated in 

Example 1 that includes codon optimized nucleotide sequences encoding functional 
components of an antibody. For example, the components can be a heavy chain, a light 
chain, a linker region or a J chain and a secretory component. The expression vector 
may also include a promoter, a signal/target/transport sequence or sequences and a 

20 terminal sequence or sequences. Preferred promoter, signal/target/transport sequence 
and terminal sequence are exemplified herein. For example, for expression of each 
functional component of an antibody in rice seeds, a codon-optimized component gene 
is operably linked to the rice endosperm specific glutelin (Gt1) promoter, a Gt1 signal 
peptide and NOS terminator to form a component expression vector. 

25 Each component expression vector is introduced to rice celts and plants to 

generate antibody component-expressing transgenic rice cells and plants, as 
exemplified in Example 2. In one embodiment, the expression vectors containing 
antibody heavy chain, light chain, linker region or J chain, and a secretory component 
can be introduced individually. The plants expressing each individual component can 

30 be crossed to generate plants that express a functional antibody. 

In another embodiment, the expression vectors containing functional 
components of an antibody can be introduced to the plant at the same time, using the 
transformation methods exemplified in Example 2, such as by co-bombardment. A plant 
that expresses functional antibody is selected for further propagation. 
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In another embodiment, the expression vector containing codon optimized 
nucleotide sequence encoding a single chain antibody is introduced to rice cells and 
plants to generate antibody expressing transgenic rice cells and plants, as exemplified 
in Example 2. The nucleotide sequence encoding a single chain antibody can be 
5 constructed as conventional in the art, for example Kortt et a/., 2001 , Maynard and 
Georgiou, 2000; Humphreys DP and Glover, 2001. 

The plant-generated recombinant antibody can be isolated and purified as 
exemplified in the patent application. 

10 B. Generation of human EGF 

The Epidermal Growth Factor (EGF) gene was codon optimized as shown in 
Figure 20, and synthesized by Operon Technologies (CA, USA) with a SEQ ID NO: 8. 
The gene was cloned into pAP1 145 and pAPI241 respectively, as exemplified in 
Example 1. The resulting plasmids were named API270 {Figure 21) and API303 (Figure 

15 22), respectively. For expression of EGF in rice seeds, the codon optimized gene was 
operably linked to the rice endosperm specific glutelin (Gt1) promoter, Gt1 signal 
peptide and NOS terminator in pAPT303, and to the rice endosperm specific globulin 
(Gib) promoter, Gib signal peptide and NOS terminator in API270. The transgenic plant 
expressing EGF was generated, and plant-generated recombinant EGF was detected, 

20 as shown in Figure 23 and as exemplified herein. 

C. Generation of human IGF 

The Insulin-like Growth Factor (IGF) gene was codon optimized as shown in 
Figure 24, and synthesized by Operon Technologies (CA, USA) with SEQ ID NO: 9. 

25 The gene was cloned into pAPI145 and pAPI241 respectively, as exemplified in 

Example 1 . The resulting plasmids were named API271 (Figure 26) and API304 (Figure 
25), respectively. For expression of IGF in rice seeds; the codon optimized gene was 
operably linked to the rice endosperm specific glutelin (Gt1) promoter, Gt1 signal 
peptide and NOS terminator in pAPI304, and to the rice endosperm specific globulin 

30 (Gib) promoter, Gib signal peptide and NOS terminator in API271 . The transgenic plant 
expressing IGF was generated, and plant-generated recombinant IGF was detected as 
shown in Figure 27 and as exemplified herein. 

35 
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D. Generation of Other Expression Plasmids 

Other expression plasmids for use in transforming plants herein for the 
production of recombinant polypeptides in transgenic plants were made substantially as 
previously described. These plasmids are shown in Figure 28, showing API321, 
5 containing a Gib promoter, a Gt1 signal peptide, codon-optimized haptocorrin gene, Nos 
terminator, and an amipicillin resistance gene; Figure 29, showing API320, containing a 
Gt1 promoter, a Gt1 signal peptide, codon-optimized human haptocorrin gene, Nos 
terminator, and an amipicillin resistance gene; Figure 30, showing API292, containing a 
Gib promoter, a Gib signal peptide, kappa-casein gene, Nos terminator, and an 

10 amipicillin resistance gene; Figure 31, showing API297, containing a Gt1 promoter, a 
Gt1 signal peptide, a gene encoding mature kappy-casein polypeptide, Nos terminator, 
and an amipicillin resistance gene; Figure 32, showing API420, containing a Gt1 
promoter, a Gt1 signal peptide, lactohedrin gene, Nos terminator, and a kanamycin 
resistance gene; Figure 33, showing API418, containing a Gt1 promoter, a Gt1 signal 

15 peptide, lactoperoxidase gene minus the sequence encoding the propeptide, Nos 

terminator, and a kanamycin resistance gene; Figure 34, showing API416, containing a 
rice Gt1 promoter, a Gt1 signal peptide, codon-optimized lactoperoxidase gene, Nos 
terminator, and a kanamycin resistance gene; and Figure 35, showing API230, 
containing a Bx7 promoter, a Gt1 signal peptide, codon-optimized lysozyme gene, Nos 

20 terminator, and an amipicillin resistance gene; Figure 36A, showing API254, containing 
a Gib promoter, a Gib signal peptide, lactoferrin gene, Nos terminator, and an amipicillin 
resistance gene; Figure 36B, showing API264, containing a Gib promoter, a Gib signal 
peptide, human lysozyme gene, Nos terminator, and an amipicillin resistance gene; 
Figure 37, showing API225, containing a GT3 promoter, a Gt1 signal peptide, codon- 

25 optimized lysozyme gene, Nos terminator, and an amipicillin resistance gene; and 

Figure 38, showing API229, containing a RP-6 promoter, a Gt1 signal peptide, codon- 
optimized lysozyme gene, Nos terminator, and an amipicillin resistance gene. 

EXAMPLE 7 

30 Comparison of Promoter Activity in the Expression of Lysozyme in Transgenic Rice 
A. Comparison between Gt1 and Gib promoters and signal peptides 
In earlier studies, inconsistencies were observed between promoter activity of 
Gib and Gt1 from transient assay data and the protein accumulation level in transgenic 
plants bearing the same promoters with signal peptides. These unpublished studies 
35 suggested that post-translational regulation was involved in recombinant protein 
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expression and accumulation in the endosperm. It was unknown whether the storage 
protein signal peptide played a role in recombinant protein expression level or whether 
heterologous proteins could be sent to the protein bodies along the sorting pathways of 
native storage proteins. In order to improve the expression level of recombinant 
5 proteins in cereal crop seed, it is important to understand recombinant protein targeting 
and trafficking in the endosperm expression system. Hence, comparison was made 
between the rice storage protein promoters and signal peptides from the Glutelin-1 gene 
("Gt1") and the globulin gene ("G/b") showed that both promoters and both signal 
peptides were capable of effecting expression of lysozyme. 

10 (i). Storage proteins 

Rice endosperm contains four main storage proteins: acid-soluble glutelin, 
alcohol- soluble prolamin, water-soluble albumin and salt-soluble globulin (Juliana BO. 
Polysaccharides, proteins, and lipids of rice. Am. Assoc. Cereal Chem., St. Paul, MN 
(1985)). They are targeted into two types of protein bodies in rice endosperm. Prolamin 

1 5 aggregates within the endoplasmic reticulum ("ER") lumen into regularly shaped vacuole 
called protein body type I. The formation of these protein bodies is dependent on the 
chaperone BiP80 in the ER. Glutelin is deposited into protein storage vacuoles (PSV) 
via the Golgi apparatus into irregularly shaped vacuole called type II protein body. The 
components in the protein body type il and its sorting pathway are not well known. The 

20 targeting locations and sorting pathway of globulin and albumin also remain unknown. It 
appears that once the signal sequence is removed in the ER, the sorting and trafficking 
depend on the targeting information within the polypeptides and chaperones in the ER. 
,,v The sorting signals are divided into three categories: sequence-specific vacuole sorting 
signals (ssVSS), C-terminal vacuole sorting signal (ctVSS), and physical structure 

25 vacuole sorting signals (psVSS), as described in Frigerio L. et ai f Plant Physiol. 126: 
167-175 (2001); Matsuoka K. et aL, J. Exp. Botany 50: 165-174 (1999) and Vitale A. & 
Raikhel N.V., Trends in Plant Science 4: 149-155 (1999). 
(ii). Method 

Two promoters storage protein genes, Gt1 and Gib, and the corresponding 
30 glutelin-1 and globulin signal peptide coding sequences were used to express the 
human lysozyme protein in developing endosperm. In the three plasmids, pAPI264, 
pAPI159 and pAPI228, the human lysozyme gene was fused with the nucleotide 
sequences of the Gib promoter and globulin signal peptide coding sequences, the Gt1 
promoter and glutelin signal peptide coding sequences and the combination of the Gib 
35 promoter with the glutelin (GT1) signal peptide coding sequences, respectively (Figure 
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39A). Lysozyme amounts of T1 seeds were determined for 23 independently 
transformaht lines of pAPI264, 10 lines of pAPI159 and 7 lines of pAPI159. The 
transgenic lines of pAPI159, which synthesized lysozyme using the Gt1 promoter and 
the glutelin signal peptide, produced the enzyme in amounts ranging from 34.25 jug to 
5 297.23 fag»mg~ 1 total soluble protein (TSP) with an average of 133.76 ^ig-mg" 1 TSP. 
Plants transformed with pAPI264 carrying the Gib promoter and the globulin signal 
peptide yielded between 4.09 and 63.64 fig*mg" 1 TSP lysozyme with an average of 
33.96 fig»mg~ 1 TSP, while lines of pAPI228, which combined the Gib promoter and the 
glutelin signal peptide, yielded between 8.9 and 203.46 jjig^mg" 1 TSP with an average of 

10 87.70 ^xg-mg 1 TSP. 

The lysozyme expression amounts achieved with the Gt1 promoter+GT1 signal 
peptide was 3.94 fold higher than that with the Gib promoter+GLB signal peptide, while 
the expression amounts of lysozyme obtained with the Gib promoter+GT1 signal 
peptide was intermediate but increased 2.58 fold over that produced with the GLB signal 

15 peptide (Fig. 39B). Apparently the GT1 signal peptide is more efficient than the GLB 
signal peptide at lysozyme expression and deposition in rice endosperm. This 
demonstrates the importance of choosing an optimal signal peptide for the production of 
recombinant proteins in developing rice grains. 

(iii) . Chimeric Gene Components 

20 Time course of human lysozyme expression during rice endosperm development. 

We monitored lysozyme accumulation during endosperm development of transgenic 
lines 159-1-53-16-1 and 264-92-6. Immature spikelets were harvested at 7, 14, 21, 28, 
35, 42 and 49 days after pollination ("DAP"). The lysozyme amounts in the endosperms 
were measured by the activity assay. Lysozyme accumulation in the seeds of 

25 transgenic plant 159-1-53-16-1 began at 7 DAP and peaked at 21 DAP. Thereafter 

lysozyme content decreased until 35 DAP and then stabilized until seed maturity (Figure 
40). Lysozyme accumulation in developing seeds of the transgenic plant 264-92-6 
likewise began at 7 DAP, peaked at 28 DAP, after which lysozyme content steadily 
decreased through seed maturation (Figure 40). These results show that lysozyme 

30 accumulation in the two types of transgenic lines during endosperm development 
follows the same pattern as that of the native globulin and glutelin storage proteins. 

(iv) . Subcellular localization of human lysozyme in transgenic seeds 

In order to determine whether the recombinant lysozyme was targeted to protein 
bodies in the endosperm, we investigated its subcellular localization by 
35 immunofluorescence microscopy. Transgenic plant 264-92-6 synthesizing lysozyme 
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with the Gib promoter and globulin signal peptide and transgenic plant 159-1-53-16-1, 
producing human lysozyme with the Gf-f promoter and the glutelin signal peptide were 
analyzed. Dual localization with either native glutelin or globulin was used to determine 
the site of lysozyme deposition. 
5 Synthetic peptides derived from the amino acid sequences of rice glutelin and 

globulin were used to raise antibodies in rabbits. Antibody specificity was confirmed 
with Western blots of endosperm proteins. No cross-reaction of glutelin and globulin 
antibodies with other endosperm proteins was detected with the host TP309 or the 
transgenic lines 264-92-6 and 159-1-53-16-1 . The human lysozyme specific antibody 
10 detected the 13 kD of lysozyme protein exclusively in the fractionated and total protein 
extracts. 

Immature seeds from two transgenic lines, 159-1 -53-1 6-1 (T4) and 264-92-6(T2) 
and untransformed control, TP309, were harvested at 14 DAP and fixed and a 
comparable analysis was conducted. In transgenic line 264-92-6, strong 

15 immunofluorescence signals of lysozyme and native proteins were detected with fully 
overlapping pattern, both when lysozyme and globulin or lysozyme and glutelin were 
compared (data not shown). Merging the two separately recorded images produces a 
yellow pseudo color signal. A scan for green and red wavelength emission across the 5 
protein bodies along the white line in the not quite perfectly aligned images identifies the 

20 co-localization of human lysozyme with globulin. The orange tinge of the protein bodies 
is due to the stronger emission of red fluorescence than green. The perfect image 
merger provides a bright yellow color, and the recording of the green and red 
fluorescence emission along the white line identifies the same 5 protein bodies. These 
results demonstrated that lysozyme was colocalized in protein bodies with the native 

25 storage proteins. The results also demonstrated that the storage proteins globulin and 
glutelin are localized in the same cell compartment, substantiating the indication that 
globulin and glutelin are targeted into the same type II protein bodies in rice endosperm. 
We conclude that lysozyme contains all sorting information for protein body targeting, at 
least when co-expressed with rice storage proteins. 

30 The localization patterns of lysozyme and native storage proteins in 159-1-53-16-1 

are, however, quite different and more complex than those of transgenic line 264-92-6. 
In transgenic 159-1-53-16-1, lysozyme does not completely colocalize with the native 
storage proteins. Globulin localized preferentially in smaller, peripheral protein bodies in 
the younger cells of the cortical region from 14 DAP endosperm, while lysozyme 

35 localized preferentially in irregularly shaped protein bodies. However, lysozyme did 
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colocalize more evenly with globulin in the older cells of the central region from the 
developing endosperm. Merging the two separately scanned images visualized green 
fluorescing, lysozyme-rich type II protein bodies and red fluorescing, smaller, globulin- 
rich protein bodies. Recording of the red and green fluorescence emission along the 
5 white scanning line reveals that there is almost twice as much lysozyme in the large 
type II protein bodies as there is in the small protein bodies, while the globulin signal in 
the small protein bodies is 2-3 times observed in type II protein bodies. Thus, there 
appears to be a preferential targeting of the two proteins. In the central region of the 
endosperm, a more equal co-localization of lysozyme and globulin was observed, 
10 especially in the larger type II protein bodies, when judged by the intensity of green and 
red fluorescence, which provides the yellow color upon merging the two images. This is 
evident from the merged image scan at the two emission wavelengths. However, there 
are also small protein bodies containing a dominant portion of globulin or lysozyme in 
these cells. 

15 Distinct patterns were also found in 159-1-53-16-1 when anti-glutelin antibody was 

co-incubated with anti-lysozyme antibody in the younger cells of the cortical region from 
mid-developing endosperm. Like globulin, most of the glutelin localized in the smaller, 
peripheral protein bodies in younger cells, while lysozyme localized in irregularly shaped 
protein bodies. Lysozyme partially colocalized with glutelin in the older cells from the 

20 center region of mid-developing endosperm. Merging the two images and scanning for 
fluorescence-intensity at the two wavelengths reveals co-localization of the two proteins 
in the large and small protein bodies, some being highly enriched in lysozyme and 
others in glutelin. A comparable distribution is observed in the cells of the central part of 
the endosperm. The results suggested that lysozyme distorted the native storage 

25 protein targeting/sorting when under the control of the Gt1 promoter/GT signal peptide, 
producing high lysozyme expression, but not when under the control of Gib promoter/ 
GLB signal peptide with lower lysozyme expression. 

To determine if native protein accumulation was affected in the endosperm of the 
transgenic plants, we analyzed the amounts of glutelin, globulin and lysozyme proteins 

30 from two transgenic lines and TP309 by Western blotting. The results showed that 
glutelin protein was reduced in 159-1-53-16-1, but was increased in 264-92-6 in 
comparison to TP309. Amounts of globulin protein were reduced in both 264-92-6 and 
159-1-53-16-1. This change is particularly significant in the transgenic line 159-1-53-16- 
1 with its higher lysozyme expression level. The results showed that globulin was more 

35 affected than glutelin, no matter which signal peptide was used. 
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Based on the results, we conclude that lysozyme was targeted to the protein 
bodies and that the signal peptide played an important role in lysozyme expression. 
The plants with high expression levels of the recombinant protein showed distorted 
native protein expression and trafficking. m : 
5 Thus, the combination of the Gt1 promoter and Gt1 signal peptide was more 

effective than the combination of the Gib promoter and Gib signal peptide, with the 
combination of Gib promoter and Gt1 signal peptide having intermediate level of activity. 
Results showed that the high level expression of recombinant protein distorted the 
trafficking and sorting of the native storage proteins and affected the native storage 
10 protein expression. Results also indicated that mature human lysozyme protein 
contains a determinant recognized in the plant cell for the protein storage vacuole (PSV) 
sorting following signal peptide cleavage, and that the lysozyme was sorted to Type II 
protein bodies. 

15 B. Comparison of seven promoters and Gt1 signal peptide in regulating the 

expression of lysozyme 

Plasmids API159 (Gt1 promoter) (Figure 1), AP1228 (Gib promoter)( Figure 39), 
API230 (Figure 35), API229 (RP-6 promoter) (Figure 38), API225 (GT3 promoter), a 
plasmid carry the Glub-2 promoter, and another plasmid carrying the Club-1 promoter, 

20 were compared in their ability to effect the expression of lysozyme in transgenic rice T1 
seeds. Results shown in Figure 41 indicate that for expression of lyzoyme, Gt1 was the 
strongest promoter, followed by Gib, Glub-2, Bx7, Gt3, Glub-1 and Rp6, in order of 
promoter strength. 

25 EXAMPLE 8 

Co-transformation of Heterologous Polypeptide and Reb Gene in Transgenic Rice 

Plants 

A. Enhanced Lvsozvme Expression in Transgenic Rice Seed Co-Transformed 
with Reb 

30 Codon-optimized human lysozyme gene was linked to Gib promoter and Gib 

signal peptide to generate plasmid Glb-Lys (API264) as shown in Figure 36B, which was 
used to transform rice with or without Native-Reb, as previously described and as 
described in WO 01/83792. Normal plant phenotypes were obtained among 
transformants containing Glb-Lys alone or with Native-Reb, To determine the presence 

35 of Reb gene and Glb-Lys in the transgenic rice genome, one primer designed from 
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vector sequence and another designed from the Reb gene 3' terminator were used to 
identify these lines. In this case, only the recombinant Reb gene could be amplified. 
PCR analysis confirmed the presence of transgenes in the rice genome. Ten of 1 1 
plants from independent transformation events contained both Reb and the lysozyme 
5 transgenes. The REB protein of immature seeds from five randomly selected transgenic 
lines was detected by Western blotting. The expression level of the REB protein in 
transgenic lines ranged from 25% to 71% higher than that in untransformed TP309. 
This demonstrated that the transgenic Reb gene was active in transgenic plants. 

Seeds of confirmed transgenic rice plants were harvested at maturity, and the 

10 lysozyme activity was analyzed. As shown in FIG. 19, lysozyme expression in the 
seeds from 30 independent transformation events containing both the Native-Reg and 
the Glb-Lys ranged from 30.57 to 279.61 //g/mg TSP with an average of 125.75 + 68.65 
/yg/mg TSP. Seeds of 17 transgenic events containing the Glb-Lys gene alone 
expressed lysozyme in amounts ranging from 7 to 76 //g/mg TSP with an average of 

15 33.95 + 20.55 //g/mg TSP. No lysozyme activity was detected in untransformed rice 
seeds. The results showed that the expression level of lysozyme increased an average 
of 3.7-fold when seeds were transgenic for both the Re£> gene and Glb-Lys. Statistical 
analysis (t test) showed that the amount of lysozyme in seeds from the plants transgenic 
for the Reb gene and Glb-Lys is significantly higher than that in the plants transgenic for 

20 Glb-Lys alone (p < 0.001 ). 

B. Enhanced Human Lysozyme Expression in Transgenic Rice Seed Co- 
Transformed with Maize Transcriptional Factor. Prolamin-Box Binding Factor fPBR 
Three transcriptional factors were tested; rice endosperm bZIP protein (REB), 

25 Opaque2 (02) and PBF. The transcriptional factors and human lysozyme gene under 
the control of rice glutelin 1 (Gt1) or globulin (Gib) promoter were co-bombarded into 
rice callus. Transgenic Ri grains carrying both genes were obtained. The effect of 
transcriptional factors on the expression of human lysozyme were monitored. Under the 
control of Gib promoter, REB increased Lys expression by about 3-fold. REB showed 

30 no effect on a stronger promoter, Gt1 . Transcription factor increased Lys expression, 
but not significantly. PBF increased Lys expression on average 1 .5-fold over Gt1~Lys 
alone. The highest Lys-expressing lines were selected and advanced to R 2 generation 
in the greenhouse. As shown in Table 8 below, Lys expression level from an R 2 line, 
265/159-41-5, was about 190/yg per grain and 9.5 mg/gram of brown rice flour 

35 (equivalent to 0.8% grain weight). The level of expression was about 1.5-fold higher 
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than that of the highest expression line without the transcription factor. In addition, data 
showed that PBF not only increased the expression of Lys, but also increased the 
expression of native storage proteins such as glutelin and globulin, and the protein 
related to protein trafficking. It implies that PBF can act on the promoters of multiple 
5 genes to increase the expression of those proteins in rice endosperm. 
Table 8. 



Line Number 


* {^g/gram} 


R *(//g/grain) 
2 


R Lysozyme 
2 

(mg/g brown rice) 


Homozygous 


285/156-41 


150.23 


150.00 


5.5 


homozygous 


69-43 


114.35 


155.00 


9.0 


homozygous 


2S5/1 59-81 


103.58 


175.55 




heterozygous 


265/1 59-2S6 


152.07 


180.00 




heterozygous 



* The expression data were averaged from 10 seeds in R n and from 10 fines in R 2 . 



10 EXAMPLE 9 

Expression of heterologous polypeptide in transgenic rice under the control of barley 
aleurone- and embryo-specific promoters 
A. Construction of aleurone-specific promoter/GUS fusion plasmids 
To obtain Ltp1 and Chi26 promoters, PCR forward primers were designed to 
1 5 cover about 900 bp upstream region of each promoter and reverse primers were 
designed to include the whole 5' untranslated region. PCR amplification of the 5' 
flanking region of Chi26 from barley genomic DNA (Himalaya) gave a 882 bp PCR 
product, called PCR-Chi26. Sequence alignments of PCR-Chi26 to Chi26 DNA 
sequences in Genbank (accession number: L34210) showed deletions/insertions and 
20 nucleotide mismatches (FIG. 28A). The PCR-Chi26 promoter with 5' UTL was 

transcriptionally fused to the ^glucuronidase reporter gene ("GUS") followed by a NOS 
terminator, resulting in plasmid Chi26/GUS/NOS (FIG. 28B). 

Variations were observed in PCR fragments for the Ltp1 promoter as well. 
Among the seven varieties of barley, the size of PCR fragments from Harrington and 
25 Himalaya was about 500 bp while the expected 957 bp fragment was observed from 
Klages, Triumph, Golden Promise and Alexis. Both sizes of the PCR fragments were 
present in I202 (FIG. 29A). PCR fragment amplified from genomic DNA of barley variety 
Triumph was cloned and called PCR-Ltp1 . Sequence alignment of PCR-Ltp1 to Ltp1 
promoter in GenBank (Accession number: X60292) shows only four mismatches (FIG. 
30 29B). The PCR-Ltp1 with 5' UTL was transcriptionally fused to the ^-glucuronidase 
reporter gene with the NOS terminator, resulting in plasmid Ltp1/GUS/NOS (Fig. 29C). 
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B. Southern analysis of transgenic plants 

Transgenic plants containing each GUS chimeric construct were obtained from 
rice callus transformed by particle bombardment. A total of 13 transgenic plants were 
obtained for plasmid Chi26/GUS/NOS and 10 plants for plasmid Ltp1/GUS/NOS. These 
5 transgenic plants were confirmed by PGR analysis. To provide physical evidence for 
the integration of both plasmids into the rice genome, Southern analysts was carried out 
on representative transgenic rice plants. DNA from four rice plants transgenic for 
Chi26/GUS/NOS was digested with HindlW and EcoRI, which should generate a 3056 
bp fragment containing the PCR-Chi26 promoter, gus gene and the Nos terminator 

10 (FIG. 28B). All four plants contained the expected 3056 bp fragment while no signal 
could be detected in untransformed TP309 plants (FIG. 30A). Similarly, DNA from five 
rice plants transgenic for Ltp1/GUS/NOS was digested with Hindlll and EcoRI, which 
should generate a 2935 bp fragment containing PCR-Ltp1 promoter, gus gene and the 
Nos terminator (FIG. 29C). All plants contained the expected 2935 bp fragment, which 

15 could not be seen in untransformed TP309 plant (FIG. 30B). In both FIG. 30A and 30B, 
multiple integration of the GUS chimeric genes into the rice genome was observed. 
Multiple integration is a common phenomenon in transformation through the particle 
bombardment (Cao, J. et ai., Plant Cell Rep 11: 586-591 (1992); Christou, P. et al. t 
Biotechnology 9: 957-962 (1991); Gordonkamm, W.J. et al., Plant Cell 2: 603-618 

20 (1990).; Tada, Y. et al., EMBO J. 10: 1803-1808 (1991). 

C. Tissue-specific expression of GUS in transgenic rice 
Transgenic rice tissues were histochemically examined for cell-specific 

expression of gus. Positive GUS expression (blue color) was similar for all transgenic 
25 grains. Typical GUS expression derived from transgenic line 217-8-1 1 transformed with 
Chi26/GUS/NOS 1 and line 220-10-1, transformed with Ltp1/GUS/NOS was obserbed. 
GUS expression under the control of the Chi26 promoter was localized to the aleurone 
cells or cells near aleurone layers as evidenced by the blue colortherein (data not 
shown)). No GUS activity was observed in any other tissues such as leaf, root or stem. 
30 To determine if GUS expression was in the aleurone cells, GUS stained grains from 

transgenic rice containing Chi26/GUS/NOS were sectioned and examined under a dark- 
field microscope. Under this condition, GUS activity was indicated as red/pink color. In 
rice, there are one to two aleurone layers (thin aleurone layer) on the ventral side of the 
endosperm and four to six layers (thick aleurone layer) on the dorsal side which can be 
35 anatomically distinguished from other cells [Matsuo, T & Hoshikawa K, Science of the 
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rice plant. Food and Agriculture Policy Research Center, Tokyo (1993)]. The GUS 
expression from the CN26 promoter was observed only in cells in the thin or thick 
aleurone layers; GUS expression was not present in the embryo nor in the non-aleurone 
endosperm cells. This indicates that these anatomically differentiated cells in the thin or 
5 thick aleurone layers have the same molecular responsiveness to Chi26 promoter. No 
GUS activity was observed in untransformed rice plants. 

In contrast to the aleurone cell-specific expression controlled by the PCR~Chi26 
promoter, GUS expression under the control of the PCR-Ltp1 promoter was confined to 
the embryo tissue (data not shown). The whole embryo, including scutelum, of the 

10 grains from rice transgenic for Ltp1/GUS/NOS was GUS-stained with blue color. A few 
aleurone cells adjacent to the embryo cells express a low level of GUS. No other 
tissues such as leaf, stem or root showed GUS staining in the histochemical analyses of 
transgenic rice. Previously, limited data from Kalla R. et aL, Plant J. 6:849-860 (1994) 
showed that the Ltp1 promoter directed GUS expression in vascular tissue of transgenic 

15 rice stem. They did not show if there was any GUS activity in rice kernels nor did they 
show which portion of the Ltp1 promoter was used to control GUS expression. 

Results indicate that in transgenic rice the PCR-Chi26 promoter directs gene 
expression in rice aleurone cells and the PCR-Ltp1 promoter directs gene expression in 
embryo cells in transgenic rice. These promoters can be used for controlling the 

20 expression of recombinant heterologous polypeptides in aleurone cells* 

Expression of heterologous polypeptide in transgenic plants such as rice can 
increase the commercial food value of rice such as by expressing therein polypeptides 
of nutritional or health benefits, including those having therapeutic or anti-infective 
properties or activities, regulate the expression of genes responsible for pest or disease 

25 resistance in rice grains to improve rice crop fitness against biotic stresses in the 

environment, and can substantiate transgenic plants as a suitable expression system for 
production of any proteins, including industrial enzymes, and other uses of recombinant 
proteins.. The expression system can also be used for a metabolic engineering project 
such as the golden rice project [Ye, X.D. et al„ Science 287: 303-305 (2000)]. 

30 It is interesting to note that the two barley aleurone-specific promoters, Lpt1 and 

Cfr/26, are effective in driving expression of heterologous polypeptide in the aleurone 
and embryo of transgenic rice seeds. It is also of interest that these two barley 
aleurone-specific promoters show different expression patterns in transgenic rice in 
contrast to their similar expression in barley. The mechanism underlying the tissue 

35 specificity of the two promoters is possibly different. Through transient analysis, Leah et 
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al. (1994) analyzed the Chi26 promoter with a series of promoter deletions, mutations 
and gain of function experiments. Their analyses identified a 33 bp sequence between - 
179 and -147, so called the E-region, functioning as an enhancer in aleurone tissue and 
as a silencer in leaf tissue. The sequence of E-region was intact in the PCR-Chi26. 
5 The sequence of the E-region is missing from the PCR-Ltp1 promoter though the E- 
region might be present in Ltp1 as a trimeric dyad symmetry [Leah, R. et al., Plant J. 6: 
579-589 (1994)]. Therefore, the aleurone specific expression of the Ltp1 gene in barley 
is likely to be mediated by the promoter element (s) and transcriptional complex(es) 
which differ from those for Chi26 genes and this difference is reflected in transgenic rice. 
1 0 Along with the previous study of the barley Ltp2 promoter [Kalla, R. t (1994)], this 

expression study of the barley Chi26 promoter using transgenic rice plants indicates that 
a molecular mechanism controlling gene expression in the aleurone cells is conserved 
between barley and rice. 

15 D. Plasmid Construction 

Standard molecular manipulations and procedures were carried out as described 
in Ausubel et al. (1987) and Hwang, Y.S. et al., Plant Cell Rep 20: 647-654 (2001). 

The 882 bp promoter region with 5' untranslated leader sequence (UTL) of 
barley Chitinase 26 was PCR-ampfified from genomic DNA of barley variety, Himalaya, 

20 using the primers, Chi26/fw: 5' AACCCTCTCTGCAGTCACCTCCTGTGAAGT 3' and 
Chi26/rv: 5' CGGAGCGATCTAGATGTGCGAGCCAACAAA 3'. The 957 bp of the Ltp1 
promoter including the 5' UTL was PCR-ampiified from the genomic DNA of barley 
variety, Triumph, using the primers, Ltp1/fw: 5' 
CCGTATAAGCATGCTTGGAACAATCTCCAC 3' and Ltp1/rv: 5' 

25 GCGCGGGTCTAGATCACTTCTTAATCTGT 3'. 

All PGR amplification was carried out using the GeneAmp PCR system (model 
2400, Perkin-Elmer). PCR reactions were performed in 50 pi of PCR reaction buffer (50 
mM KCI. 1.5 MgCI 2 10 mM Tris-HCI, pH 9.0, and 0.1 %Trition X-100) containing 250 pM 
dNTP, 1 pg of genomic DNA as template, 0.1 pM each of forward and reverse primers, 

30 and 0.3 unit of Tag polymerase (Perkin-Elmer). After a two min pre-denaturing step at 
94 ° C, the reaction was run for 30 cycles with a 30 s denaturation at 94 0 C, a 30 s 
annealing at 58 0 C, and a two min extension at 72 0 C. The final extension step was 
five min at 72 ° C, followed by a 4 0 C soaking step. The PCR-amplified fragments were 
extracted with phenol:chloroform:isoamyl alcohol. The PCR product containing the 

35 Chi26 promoter and 5'UTL was digested with Pst I and Xba\ and subcloned into the 
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GUS expression cassette of pBl221 (Ciontech, CA) to generate Chi26/GUS/NOS 
(API217). The Ltp1/GUS/NOS construct (API220) was generated in the same way 
using a different set of restriction enzymes (Sph\ and Xba\). 

E. DNA sequence analysis 

Plasmid DNAs containing Ltp1/GUS/NOS (AP.I220, FIG. 28B) or 
Chi26/GUS/NOS (API217, FIG. 29C) were prepared with the Qiagen miniprep kit 
(Qiagen Inc. Valencia, CA) and the promoter regions of Ltp1 and Chi26 were 
sequenced by ABI automatic sequencer. 

F. Transformation 

The transgenic rice plants containing Chi26/GUS/NOS or Ltp1/GUS/NOS were 
produced as described in Chen, L. et al., Plant Cell Rep 18: 25-31 (1998). 

15 G. Southern blotting 

About three grams of green leaves were collected from T 1 transgenic plants and 
ground in liquid nitrogen into a fine powder.- The DNA was isolated according to the 
procedure of Dellaporta, S.L et al. (1983) and was purified by phenol/chloroform 
extraction. About 5 ^g of rice genomic DNA was digested with HindiW and EcoRI and 

20 used to prepare blots for Southern analysis. The southern blotting was performed as 
described in Ausubel et al. (1987). The ECL™ direct nucleic acid labeling and detection 
system (Amersham, USA) was used for analysis. 

H. GUS histochemical staining 

25 Mature T2 grains derived from all fertile transgenic rice T, plants were harvested 

and soaked in ddH 2 0 for one day at 4 0 C. These grains were sectioned longitudinally 
with a sharp razor blade into half-grain portions containing half-embryos. The sections 
were then stained with X-glucuronide solution for 24 hrs according to Jefferson, R.A. et 
aL, Plant MoL Biol. Rep. 5: 387-405 (1987). 

30 

I. Anatomy and microscopy 

The GUS stained half grain was fixed under vacuum overnight in a solution 
composed of 2.4 % glutaraldehyde, 0.3 % paraformaldehyde, and 0.025 M Pipes buffer. 
After fixation, the material was rinsed three times in 0.025 M Pipes buffer for 25-30 
35 minutes each and dehydrated using a graduated ethanol series of 10 - 100 % for 25-30 
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minutes for each step. This was followed by resin infiltration with 3:1, 1:1, 1:3 100 % 
ethanot: Historesin with each of these steps lasting for 24 hrs. Next the material was 
immersed in 100 % resin where it sat for 10 weeks. After the infiltration period, the 
material was transferred to an aluminum weigh boat for embedding in 10 ml of 
5 Historesin with 0.667 ml of hardener. Once the plastic had set, the material was cut out, 
trimmed and mounted onto wooden dowels for sectioning. The material was cut 2-3 |jm 
thick and imaged on an Olympus Vanox AH-2 using darkfield optics. 

Example 10 

10 Expression of heterologous polypeptide in transgenic rice under the control of a rice 

embryo-specific gene promoter 

A. Plasmid for Expression in Embryo & Aleurone of Transgenic Grain 
Plasmid pAPI224 was made in the following manner. A 246bp PCR fragment 

15 containing the first 130bp of the EmpH coding region was amplified from rice genomic 
DNA variety M202 with forward primer CATCCCCATCAGCTCAAGCCGC and reverse 
primer TGCATACCCTCGGCGAGGTTCTCCTGGGC. 

This PCR fragment consisting of 1 16bp 5' flanking region and 130bp of the Em 
cdding region was cloned into pCR-2.1 a TA cloning vector (Invitrogen). This PCR 

20 product was amplified and used to probe to a rice BAG library. A BAG clone, 35 B20, 
containing Em gene was isolated. 5.5kb of the EM promoter from a BAG clone, 35B20, 
was cloned into pBluescriptll KS+ using a Xhol digest. A 3.0kb Hindlll/NotI fragment of 
this plasmid was cloned into pCR2.1 a TA cloning vector. To this plasmid a 191 bp PCR 
product (digested with Notl/Xbal) containing the Em leader sequence was added. The 

25 3.0kb Em promoter along with the leader sequence was cloned into pBI221 replacing 
the 35S promoter using a Hindlll/Xbal digest resulting plasmid API224 which contains 
Em promoter, GUS gene and Nos terminator. 

The plasmid API224 was used to transform rice as described. The transgenic seeds 
were analyzed for GUS activity. It was found that GUS activity was located in embryo and 

30 aleurone layer only. 

B. Cloning of the rice Em protein gene Emp1 

The promoter region of the rice Emp1 gene upstream of the translation start site 
was cloned into plasmid API224 (FIG. 31) by conventional techniques. The cloning of 
35 the rice Emp1 gene was described in Lifts, J.C. et al. , Plant Mol. Bio. 19:335-337 

95 



3NSDOCID: <WO 



020647 B0A2 \_> 



WO 02/064750 PCT/US02/04909 

(1992). The Emp1 gene sequence from Lifts, J.C. et al. is reproduced herein as FIG. 
53. Plasmid AP1224 further contained the ^-glucuronidase reporter gene and Nos 
terminator. Rice cells were transformed with API224 as described previously and one 
transgenic line was studied in greater detail histochemically, as described above. 
5 As shown in FIG. 54, Empl /GUS expression was observed in transgenic line 

257-10-1 in the embryo (panel a), thick layer aleurone (panel b), and thin layer aleurone 
(panel c). No GUS expression was observed in the seeds of control untransformed rice 
plant Taipei 309 in any of the embryo (panel d), thick layer aleurone (panel e) or thin 
layer aleurone (panel f) tissues. 

1 0 Tissue specificity of GUS expression under the rice Emp1 promoter was also 

studied by histochemical analysis, substantially in the same manner as described 
above. As shown in FIG. 55, GUS expression was observed in transgenic line 257-10-1 
in the mature seed (panel a), and in the mature embryo (panel b). No GUS expression 
was observed in the leaf section (panel c), root section (panel d) or stem cross-section 

15 (panel e). No GUS expression was observed in the control untransformed rice seed of 
Taipei 309, used as control. 

EXAMPLE 11 

Production of rice extract containing recombinant proteins and its use in food 

20 A. General procedure for production of rice extract 

Transgenic rice containing heterologous polypeptides can be converted to rice 
extracts by either a dry milling or wet milling process. In the dry milling process, 
transgenic paddy rice seeds containing the heterologous polypeptides, such as 
recombinant human lysozyme or lactoferrin were dehusked with a dehusker. The rice 

25 was grounded into a fine flour though a dry milling process, for example, in one 

experiment, at speed 3 of a model 91 Kitchen Mill from K-TEC. Phosphate buffered 
saline ("PBS"), containing 0.135 N NaCI, 2.7 mM KCI, 10 mM Na2HP04, 1.7 rnM 
KH2P04, at pH 7.4, with or without additional NaCI, such as 0.35 N NaCI, was added to 
the rice flour. In some experiments, approximately 10 ml of extraction buffer was used 

30 for each 1 g of flour. In other experiments, the initial flour/buffer ratio varied over a 

range such as 1 g/40 ml to 1 g/10 ml. The mixture was incubated at room temperature 
with gentle shaking for 1 hr. In other experiments, the incubation temperature was lower 
or higher, such as from about 22°C to about 60°C, and the incubation time was longer 
or shorter, such as from about 10 minutes to about 24 hr. A Thermolyne VariMix 

35 platform mixer set at high speed was used to keep the particulates suspended. 
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In place of PBS, other buffers were used in some experiments, such as 
ammonium bicarbonate. In one embodiment, 10 liters of 0.5M ammonium bicarbonate 
was added to 1 kg of rice flour. 

The resulting homogenate was clarified either by filtration or centrifugation. For 
5 the filtration method, the mixture was allowed to settle for about 30 minutes at room 
temperature, after which the homogenate was collected and filtered. Filters in three 
different configurations were purchased from Pall Gemansciences and used. They 
were: a 3 jxm pleated capsule, a 1.2 fxm serum capsule and a Suporcap capsule 50 {0.2 
jim). For centrifugation, a Beckman J2-HC centrifuge was used and the mixture was 
10 centrifuged at 30,000 g at 4°C for about 1 hr. The supernatant was kept and the pellet 
was discarded. 

In one embodiment, the filtrate and supernatant were further processed, for 
example by ultra-filtration or dialysis or both to remove components such as lipids, 
sugars and salt. 

1 5 The filtrate from the above filtration procedure, which is also called the clarified 

extract, was then concentrated using a spiral wound tangential flow filter operated in a 
batch recirculation mode. In one embodiment, PES (polyethersulfone) 3000-4000 
molecular weight cutoff membranes was used for this step. These final concentrated 
extracts were held overnight in a cold room. 

20 The concentrated extracts were next dried to a powder by lyophilization. During 

loading of the lyophilizer trays, the extracts were not subjected to a final 0.2 or 0.45 
micron depth filtration to minimize loss of target proteins. The lyophilized material was 
scraped from the lyophilizer trays and combined into a plastic bag. The dry material 
was compressed by drawing a vacuum on the bag and then the material was blended 

25 and the particle size reduced by hand-kneading it through the plastic. 

The lyophilized materials were then suitable for use as an extract directly or in 
admixture with other food. In one experiment, the lyophilized materials were blended 
with various ingredients to produce control and test infant-formula. The ingredients 
were blended using a Hobart mixer (140 quart size) equipped with a paddle agitator. 

30 These final blends were packed in 1 kg double Mylar bags and the headspace was filled 
with nitrogen before sealing. 

Table 9 shows the recovery of recombinant human lactoferrin from 105 kg 
transgenic rice flour during each extraction step. The amount of recombinant human LF 
present was determined quantitatively as described in Example 4. 

35 
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Table 9. 





OLactoferrin 


Stage of process 


Lac mass 


% of max 


Baseline extraction yield 


4.0 mg/g flour 




Expected maximum 


420 g 


100% 


Initial extract 


338 q 


80 


Clarified extract 


373 g 


89 


Concentrated extract 


343 q 


82 


Dried extract 


340 g 


81 



Rice extract can also be produced using a wet milling procedure. Transgenic 
paddy rice seeds containing recombinant human lysozyme were re-hydrated for a 
5 period of 0 to 288 hrs at 30oC. The rehydrated seeds were ground in PBS extraction 
buffer. The initial seed/buffer ratio varied over a range such as 1 g/40 ml to 1 g/10 ml. 
Table 10 shows recovery of human lysozyme from rice seeds soaked from 0 to 288 hrs. 
Table 10. 



Rehydration time (hrs) 


Lysozyme (ug/grain) 


Recovery (%) 


0 


87 


100 


48 


69 


79 


60 


79 


91 


168 


60 


69 


216 


56 


64 


288 


58 


67 



10 Over 60% human lysozyme was recovered from the wet milling process. The 

result of the wet milling becomes initial extract which may be stored frozen until use. 
The processing of initial extract to obtain dried extract was the same as that described 
for dry milling in this section. 

15 B. Concentration and diafiltration of recombinant lysozyme and control rice 

extracts 

The conditions used in concentration and diafiltration varied depending on 
volume, speed, cost, etc. These conditions are all routine in the art based on the 
description herein. The frozen initial extract was thawed in the coldroom (about 2-8°C) 
20 for six hours. The thawed material were clarified though a 0.45}im filter and 
concentrated using a 5000 Nominal Molecular Weight Cutoff membrane of 
Polyethersultone. 

90 ml of the filtrate of control extract wa%concentrated to 1 0 ml and additional 1 0 
ml of deionized water was added to the concentrated filtrate. The diluted filtrate was 
25 diafiltrated one more time using water. The precipitate started forming at 16 mS and 
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increased as the ionic strength decreased. 1 M ammonium bicarbonate was added to 
the retentate to add ionic strength. The haze decreased although did not disappear 
completely. The materia! was diafiltered multiple times, in one embodiment three times, 
with water and multiple times, in one embodiment three times, with 0.1 M ammonium 
5 bicarbonate. It was concentrated to 9 ml and the membrane was rinsed with 0.1 M 
ammonium bicarbonate. The concentrate was filtered through several 0.2 \irr\ button 
filters. In one embodiment, 2.3 ml of the filtrate was lyophilized as is; 2.3 ml of the 
filtrate was diluted to 12 ml with deionized water and lyophilized, and 2.0 ml of the filtrate 
was diluted to 25 ml with deionized water and lyophilized. All remained clear. 

10 A total of 89 ml of the filtrate of rHLys extract was concentrated to 10 ml, and 

additional 10 ml of 0.1 M ammonium bicarbonate was added. The resulting mixture was 
concentrated back to 10 ml and another 10 ml of 0.1 M ammonium bicarbonate was 
added. The retentate started to haze up. The materia! was diafiltered multiple times, in 
one embodiment three times, with 0.1 M ammonium bicarbonate. It was concentrated 

15 to 9 ml and the membrane was rinsed with 0.1 M ammonium bicarbonate. The 

concentrate was filtered through several 0.45 p,m button filters. In one embodiment, 2.0 
ml of the filtrate was lyophilized as is; 2.0 ml of the filtrate was diluted to 12 ml with 
deionized water where a haze formed, and lyophilized, and 2.0 ml of the filtrate was 
diluted to 12 ml with 0.1 M ammonium bicarbonate which remained clear, and 

20 lyophilized, 

C. Comparison of trial extraction of recombinant Ivsozyme rice with PBS and 
ammonium bicarbonate 

The conditions used in concentration and diafiitration varied depending on 

25 volume, speed, cost, etc. These conditions are all routine in the art based on the 

description herein. rHIys rice flour was mixed with extraction buffer at about 100 g/L for 
about 1 hour using a magnetic stir bar. In one 2 liter beaker, the extraction buffer was 
PBS, pH7.4 plus 0.35 M NaCI. In another 2 liter beaker, the extraction buffer was 0.5 M 
ammonium bicarbonate. A 15 cm buchner was pre-coated with about 6g of Cel-pure 

30 C300 before adding another 20g of Cel-pure C300. The mixture was filtered at about 
3-4 Hg. It was then washed twice with about 100 ml of respective extraction buffer. The 
extracted filtrate was collected and concentrated with ultra-filtration cartridges: 5K 
Regenerate Cellulose, 5K PES, and 1K Regenerated Cellulose. The concentrates were 
lyophilized and analyzed for rHIys contents. The ammonium bicarbonate and PBS, 

35 pH7.4 plus 0.35 M NaCI both extracted approximately the same amount of rHIys. There 
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was little loss of lysozyme units in the permeate with any of the ultrafiltration units that 
were used. 

Other extraction buffer can also be used to extract recombinant proteins 
expressed in transgenic rice grains, for example Tris buffer, ammonium acetate, 
5 depending on applications. For example, for using recombinant human LF for iron 
supplement, iron may be added to the extraction buffer and the buffer is set at a pH so 
that the apo-LF can pick up iron during the extraction process. Under this condition, LF 
can become saturated with iron (hoto-LF). In another example, a buffer lacking of iron 
and a pH resulting in iron release from LF is used to produce apo-LF. 

10 

D. Production of rice extracts containing recombinant proteins 
The conditions used in concentration and diafiltration varied depending on 
volume, speed, cost, etc. These conditions are all routine in the art based on the 
description herein. All equipment was soaked in hot 0.1M NaOH at a starting 
15 temperature of about 55°C. Rice flour was added to an about 250-500 gal. stainless 
steel tank containing 0.5M ammonium bicarbonate at about 95-105 g/L. It was mixed 
for about 60-80 minutes at about 9°C. 

12 plates of 36 inch filter press C300 were pre-coated with about 3-6 kg Cel-pure 
C300. About 19-26 g/L of Cel-pure was added to the extract and mixed thoroughly. The 
20 mixture was pressed at a pressure of about 22 psi at a flow rate of about 82 

liters/minute. The filtrate was collected into a 250 gal. stainless steel tank and washed 
with 0.5M ammonium bicarbonate. The press was blown dry. The process was carried 
out at about 10°C. 

The 300 NMW cut-off membranes (Polysulfone) which had been cleaned and 
25 stored with 0,1 M NaOH after control run was rinsed thoroughly with deionized water. 
The extract was concentrated and bumped to a 100 gal stainless steel tank. The 
membrane and the concentration tank were flushed with 0.1 M ammonium bicarbonate 
to recover all of the products. The product were covered with plastic and left in the 100 
gal tank overnight at room temperature. The concentrate was filtered through spiral 
30 wound 1 jim filter and into 5 gal poly container. The concentrate was lyophilized. About 
81% of lactoferrin and about 58% of lysozyme was recovered from transgenic rice 
grains, respectively. 

35 
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E. Blending of rice extract containing recombinant proteins into infant formula 
The three types of lyophilized dry extract that contains rice proteins (control) or 

rice proteins with lysozyme or lactoferrin were combined with standard infant formula. 

The blending was done such the final infant formula contained about 1 gram lactoferrin 
5 and 0.1 gram lysozyme per liter of infant formula. The ingredients were blended using a 

Hobart mixer (140 quart size) equipped with a paddle agitator. These final blends were 

packed in 1 kg double Mylar bags and the headspace was filled with nitrogen before 

sealing. 

Samples of infant formula containing human lysozyme and lactoferrin were 
10 quantified using procedures described in Example 3 and 4. 

Table 11 shows human lysozyme and lactoferrin in infant formula. 
Table 11. 



Infant Formula 


Lactoferrin (mg/ ml) 


Lysozyme (mg/ml) 


With control rice extract 


0.0 


0.0 


With transgenic rice extract 


1.03 


0.13 



15 Using extract as a delivery method of recombinant protein has clear advantages 

over the purified form or in the whole grain. The conventional approach, such as in the 
whole grain form, has limitations such as protein stability during high temperature and 
pressure processing. Furthermore, the purification approach is expensive. Therefore 
the extract approach 1 ) maintains a low cost compared to purification approach; 2) 

20 requires much smaller volume, for example about 1-10% of whole grain weight; 3) 

increases the concentration of recombinant protein from about 0.05-0.5% in whole grain 
form to about 10 to 20 % in the extract form. Some extract form even reaches 40% 
depending on the expression level of recombinant protein. Therefore, the extract 
approach will allow broader application of the recombinant proteins compared to the 

25 whole grain approach. In addition, the extract approach removes starch granule, which 
requires high gelling temperature, for example about 75°C. Consequently, the extract 
approach provides more flexibility in processing the rice grain and the recombinant 
proteins into food and diet, and the alike, without worrying about using high temperature 
to denature starch granule. The undenatured starch granule cannot be digested by 

30 human gut without gelatinization by for example high temperature. 
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EXAMPLE 12 

Plant Transcription Factors 
A. Cloning Of The Reb Gene From A Rice BAC Library 
Reb was cloned from a rice BAC library. The Reb gene including the introns, 
5 promoter and 3 - UTR region is 6,227 bp long, comprises 6 exons and 5 introns and is 
flanked by a 1 .2 kb 5* promoter and a 1 .2 kb 3-terminator region. The function of the 
Reb gene was explored using effector constructs containing the Reb gene together with 
the native Reb promoter and fusion genes linking Reb to the rice actin (Act) or globulin 
(Gib) gene promoters. (See Fig. 6A-B.) 

10 PGR primers were designed based on the Reb gene sequence provided in 

Nakase, 1997 and used to screen a rice bacterial artificial chromosome (BAC) library 
[Yang, 1997] using a screening strategy for tri-dimensional DNA pools of the BAC library 
as described by Xu, 1998. 

PCR was carried out with 100 ng pooled BAC DNA, 10 mM Tris (pH 8.3), 1.5 

15 mM MgCI 2 , 50 mM KCI, 0.5 jaM dNTP and employed a program of denaturing at 94 °C 
for 5 min followed by 30 cycles of 94 °C for 45 seconds, 60 °C for 45 seconds and 72 °C 
for 1 min, using a forward primer (S'^TTGATATGTGCCCATGTTCCAAAC-S') and a 
reverse primer (5-CCTTGCTGAATGCAGATGTTTCAC -3'). The plasmid DNA of a 
positive BAC clone was prepared as described [Yang, 199*7], the BAC DNA digested 

20 with Hind\\\ and the presence of the Reb gene confirmed by Southern analysis 
[Sambrook J., 1989]. 

The Reb gene was retrieved from the BAC by subcloning two fragments into the 
pBluescript KS+ vector (Stratagene, CA). First, the promoter and partial coding region 
was obtained as a Kpn\-Hind\\\ fragment, followed by a second step where a Hind\\\ 

25 fragment containing the remaining coding region and the 3* terminator region was 

obtained by shut-gun cloning. The two fragments were ligated at the internal Hind\\\ site 
generating an intact Reb gene and the complete Reb gene was generated by ligating a 
1,775 bp fragment containing the promoter and the 5' coding region, to a 4,452 bp 
fragment containing the 3' coding and terminator region. 

30 The Reb DNA was sequenced with an automatic DNA sequencer (ABI 371) 

which revealed 5 introns, 6 exons, 1.16 kb of the 5' promoter sequence and 1.2 kb of 
the 3' region totaling 6,227 bp. (Fig. 2A-I). A comparison of the open reading frame of 
the isolated Reb gene with the Reb cDNA gene found at GenBank Accession No. 
ABQ21737 revealed 99.97% DNA sequence similarity and 99.99% amino acid similarity 
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resulting from two amino acid changes: lie 165 ->Asn and Glu 2 i 5 ->Lys. These differences 
are likely to be due to polymorphisms among rice varieties. 

B - Opaque2 (02) and Prolamin Box Factor (PBF) 
5 The nucleic acid sequence for the Opaque2 transcription factor found for 

example at GenBank Accession number X15544 (opaque2 gene), M29411 (opaque2 
cDNA) was cloned into an expression vector under the control of the rice glutelin-1 (Gt- 
1 ) promoter (Fig. 1 1 C, maize). 

The rice PBF and maize PBF coding sequences found for example at GenBank 
10 Accession numbers D1 1385 (rice cDNA) and ZMU82230 (maize cDNA) were also 

cloned into an expression vector under the control of the rice gluteiin-1 (Gt-1) promoter. 

The DNA sequence of the rice (Oryza sativa) globulin promoter, ("Gib") with 
putative binding sites for the 02 transcription factor and the prolamin box and the DNA 
sequence of the wheat Bx7 promoter with putative binding sites for the 02 transcription 
15 factor and the prolamin box are shown in Figs. 9 and 10, respectively. 

Promoters were digested to produce the appropriate cohesive ends and cloned 
into compatible sites in a reporter construct. In one example, the reporter construct is 
comprised of the rice globulin (Gib) or wheat Bx7 promoter translationally fused with 
GUS, with the resulting constructs designated, Glb/GUS and Bx7/GUS (Fig. 12A), 
20 The effect of plant transcription factors on the gene expression was evaluated by 

co-transformation with a heterologous nucleic acid construct effective to express the 
transcription factor, e.g. 02, PBF, BPBF or Reb. 

EXAMPLE 13 

25 Transient Expression Assays with the Reb Transcription Factor 

A. Plasmid Construction 

Plasmids were constructed using standard molecular biological techniques as 
described in Ausubel et a/., 1987. Plasmids API212 (Glb-GUS) carrying the rice globulin 
gene promoter fused to the GUS reporter gene and AP1 142 (Gt1-GUS) containing the 

30 rice glutelin 1 gene promoter fused to the GUS reporter gene were used for transient 
expression assays. The globulin promoter (GenBank Accession number X63990) and 
glutelin 1 promoter (GenBank Accession number Y00687) were obtained from M202 
(Oryza sativa Joponica subsp.) by amplification with PCR. The plasmid containing the 
Reb gene under the control of its native promoter was designated pAPI267 (Native- 

35 Reb). The plasmid designated Glb-Reb (pAPI266) was prepared by cloning the Reb 
coding region (NruUSacl fragment) into the Glb-GUS plasmid after removal of the GUS 
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gene by digestion with Smal/Sacl, thus replacing the GUS gene with the Reb gene. 
Using the same strategy, the plasmid designated Actin-Reb (pAPI277) was made by 
replacing the GUS gene of plasmid Act1-D-GUS (McElroy, 1990) with the complete Reb 
gene (Nrul/Sacl fragment). The Act1-D promoter was kindly provided by Professor Ray 
5 Wu, Cornell University (McElroy, 1990). 

B. Transient Assay using Rice Endosperm 

Rice spikelets with immature endosperm (7-9 days after pollination, dap) of 
M202 (Oryza sativa Japonica subsp.) were collected from plants grown in the 

10 greenhouse at 30°C. The spikelets were sterilized with 70 % ethanol for 10 min. After 
evaporation of residual ethanol, the endosperm was dissected and 10 immature 
endosperms placed on a filter paper in a Petri dish containing AA medium [Chen et a/., 
1 998] supplemented with 20mM ammonium nitrate. 

Fifty nl of gold particles (60mg-mr 1 at 1 :1 ratio of 1 .0 and 1 ,5-3.0 urn diameter 

1 5 gold particle) were coated with 5 jag DNA consisting of a mixture of the reporter gene, 
the effector gene, and the internal control gene, typically at a molar ratio of 1 : 1 : 1 . DNA 
coating was accomplished as described in the instruction manual of the Biolistic PDS 
system (Bio-RAD, Hercules, CA, USA). pAHC18 containing the luciferase gene driven 
by an ubiquitin promoter [Christensen et a/., 1996] was used as an internal control. In 

20 tests without the effector gene, pAHC18 was replaced by pBluescript DNA. Particle 
bombardment was carried out with a biolistic Helium gun device at 1 100 psi (Biolistic 
PDS 1000/He system, Bio-Rad, Hercules, CA, USA). After bombardment, the immature 
endosperms were incubated at 25°C for 24h in 5 ml of AA medium supplemented with 
20 mM ammonium nitrate, 50jug*mr 1 cefotaxin and 50jig-mr 1 timentin (Sigma, Louis, 

25 MO, USA) to prevent bacterial growth. The endosperms were then harvested and 
ground with 55 pJ extraction buffer (0.1 M potassium phosphate, pH 8.0, 1 mM EDTA, 
10 mM DDT, 5 % glycerol, 0.2mM leupeptin and 0.2 )iM phenylethylsulfonyl fluoride 
(PMSF). The extract was centrifuged at 25,000 g for 5 min at 4°C. From the 
supernatant, a 20 pi aliquot was added to 180 jal of luciferase assay buffer (0.25 M 

30 Tricine, pH 7.8, 150 mM magnesium chloride, 10 mM ATP, 1 mM DDT and 100 p.g ml" 1 
BSA). Another 20 \±\ aliquot was added to 200 \x\ of GUS assay buffer (Tropix, Bedford, 
MA, USA). Luciferase activity was measured after incubation at 25°C for 20 min and p- 
glucuronidase activity after 1h at 37°C for 1h with a Monolight 2010 chemi-illuminometer 
according to the manufacturer's instructions (Analytical Luminescence Lab., Monolight, 

35 San Diego, CA, USA). (3-glucuronidase activity was normalized to the luciferase activity 
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and expressed relative to the activity of the GIb-GUS. In general, the data presented 
reflect an average of at least six assays of two independent experiments. 

C, Transcriptional Activation with Reb 

5 The function of Reb gene was analyzed by the transient assay with rice 

immature endosperm. Transient expression studies were carried out to evaluate the 
effect of Reb on expression of GUS (/^-glucuronidase), under the control of the Gib 
promoter 

The results shown in Fig. 7B indicate that GUS expression was increased when 
1 0 Reb was expressed under the control of the globulin promoter, the actin promoter or the 
native Reb promoter suggesting that Reb is effective to an activate expression mediated 
by Gib promoter. 

Putative Reb binding sites in the globulin (Gib) promoter were identified as 
shown in Fig. 4. 

15 In order to determine, if binding of the Reb protein to the motif 

(GCCACGT(A/C)AG) in the globulin (Gib) gene promoter activates transcription of this 
promoter, plasmids containing fusions of the Reb coding region with the Gib promoter 
and the rice actin (Act) gene promoter were prepared (Fig. 7A). These as well as the 
expression plasmid containing the native Reb gene (pAPI266) were co-bombarded into 

20 the rice endosperm with a plasmid containing the GUS reporter gene driven by the Gib 
gene promoter and an internal control plasmid containing the luciferase gene driven by 
the ubiquitin promoter. 

Setting the level of GUS expression by the globulin gene promoter as 1, the co- 
delivery of the plasmids containing the Reb gene increased GUS expression 

25 irrespective of whether the gene was driven by its own promoter or the Gib promoter or 
the Actin promoter (Fig. 7B). The increases were 2.43, 2.01 and 1.98 fold, respectively. 
The activation of GUS expression was abolished when a promoter-less Reb construct 
was co-bombarded with GIb-GUS (Fig. 7B). These results suggested that the Reb 
protein functions as a transcriptional activator of the Gib promoter. 

30 

D. Identification of the Upstream Activation Sequence (UAS) for Reb 

Using a band-shift assay, Nakase et a/., 1997 have shown that Reb binds to two 
motifs, GCCACGTAAG or GCCACGTCAG. An analysis of the Gib promoter sequence 
revealed two copies of GCCACGTAAG and one copy of GCCACGTCAG clustered in 
35 the sequence region -700 bp distal to the TATA box of the promoter (Fig. 4). 
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In order to determine whether the binding motifs for Reb signify an upstream 
activation sequence (UAS), 200 bp of the Gib promoter, which contains the three motifs 
located at positions -642 to -842 distal to the TATA box were deleted from the Gfb 
promoter (Fig. 8A). The deletion was demonstrated to have no effect on the expression 
5 of GUS as both the Glb-GUS and the GlbAUAS-GUS (UAS deleted) constructs showed 
the same level of background GUS expression (Fig. 8B). 

When Native-Reb was co-bombarded with GlbAUAS-GUS, the transcriptional 
activation by Reb which was evident when Native-Reb was co-bombarded with Glb- 
GUS was lost (Fig. 8B). These data indicate the transcriptional activation of Glb-GUS 

10 by Reb occurs through this 200bp fragment containing Reb binding motifs. 

A scan of the rice gluteiinl (Gt1 ) promoter sequence did not reveal the presence 
of Reb binding motifs. Accordingly, Gt1 was selected as a candidate for the introduction 
of the UAS from the Gib promoter in order to test for gain of the Reb response function. 
Heterologous nucleic acid constructs were prepared containing the native Gt1 promoter 

1 5 linked to the GUS gene (Gtl-GUS), and a Gt1 promoter modified to contain a 98 bp Reb 
UAS fragment containing 3 copies of GCCACGT(C/A)AG (amplified from the Gib 
promoter) was inserted at position -630 bp distal to the TATA box of the Gt1 promoter in 
order to generate Gt1+UAS-GUS (Fig. 9A). 

When Gt1-GUS was tested by co-bombardment of developing endosperm with 

20 the native Reb gene and Gt1 -GUS, the results showed that Reb does not activate the 
Gt1 promoter (Fig. 9B). Gt1+UAS-GUS was tested for GUS expression and it was 
shown that addition of the Reb UAS to the Gt1 promoter did not increase its capacity for 
GUS expression significantly (Fig. 9B). However, when Gt1+UAS-GUS was tested by 
co-bombardment of developing endosperm with the native Reb gene a 2.5 fold increase 

25 in GUS activity was obtained (Fig. 9B). 

The Reb protein was previously described as a transcription factor. The results 
described herein show that (1) Reb is a transcriptional activator, as evidenced by a 2.0 
to 2.5-fold increase in GUS activity when Reb effector constructs were co-transferred 
with the reporter uid A gene encoding GUS under the control of the Gib promoter into 

30 immature rice endosperm cells; (2) Reb specifically activates the Gib promoter but not 
gluletin gene family promoters; (3) Reb interacts with an approximately 100 bp upstream 
activation sequence (UAS) containing the motifs GCCACGT GAG and GCCACGTAAG 
(GCCACGT(A/C)AG) of the Gib promoter, as confirmed by loss-of-function and gain-of- 
f unction experiments. The loss of activation function, when the 200bp fragment 

35 containing the Reb UAS is removed from the Gib gene promoter, and the gain of this 
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function, when the 98bp fragment with Fteb UAS is added to the Gt1 promoter, 
establishes the 98bp fragment as an upstream activation sequence (UAS). 

EXAMPLE 14 

5 Transient Expression Assays with the Q2 and PBF Transcription Factors 

A. Plasmid Construction 

Plasmids were constructed using standard molecular biological techniques as 
described in Ausubel et a/., 1987. A 693 bp segment of the Gt1 promoter sequence 
was removed from pGtl v3.0 SDM, a Gt1 expression vector, and used to replace the 

10 CaMV 35S promoter in pBI221 through Hind\\\ and Smal sites, resulting in 

Gt1/GUS/NOS. The promoter regions from the rice glutelin genes Gf3, GluB-1 and 
GluB-2; the rice prolamin genes RP6 and PG5a\ the rice globulin gene Gib; and the 
wheat glutelin gene, 6x7; were PGR amplified from M202 genomic DNA or the wheat 
variety, Anza, for Sx7. 

15 The PCR amplifications were carried out using the GeneAmp PGR system 

(model 2400, Perkin-Elmer) operated according to the manufacturers instructions. 
Basic cycling conditions were 30 cycles, after a 2 minute pre-denaturing step at 95 °C, 
with a 30 second denaturing step at 95 °C, a 30 second annealing step at specific 
temperature, and a 2 minute extension step at 72 °C. The final extension step was 5 

20 minute at 72 °C, followed by 4 °C soaking step. Reaction components per 50 pi volume, 
were 1 pg of genomic DNA or 1 ng of plasmid DNA, 2.5 pi of 5 pM primer mixture, 5 pi 
of 10 mM dNTP, 2.5 units of Tag polymerase (Perkin-Elmer), 5 pi of 10X PCR buffer 
(Perkin-Elmer). The concentration of MgCI 2 was 1.5 mM, for all the promoters with the 
exception of the Bx7 promoter for which 2.5 mM MgCI 2 was used. All the PCR primers 

25 and amplified fragment sizes are presented in Table 12. 

Each of the PCR amplified promoter sequences was cloned into the GUS 
cassette of pBI221 through Pst\ and Xba\ sites to give Gt3/GUS/NOS, GluB- 
1/GUS/NOS, GluB-2/GUS/NOS, RP6/GUS/NOS, PG5a/GUS/NOS, Glb/GUS/NOS, 
Bx7/GUS/NOS, for the Gf3, GluB-1 t G/uB-2, PG5a t RP6> Gib and Bx7 promoters, 

30 respectively. 
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Primers 


Primer sequences 


PCR 

amplified 
fragment 


Annealing 
temperature 


Gt3/fw 
Gt3/rv 


GTTAGTcTG CAgTGT AAGTGTAGCTTC 
ATGGTTGtCtaGa I I I I GTGGGACTGAAC 


856 


58°C 


GluB- 
1/fw2 
GluB- 
1/rv2 


ACAGACAGcTGcAGAGATATGGATTTTC 
TAAG 

GGAACTCtCtAgAGCTATTTGTACTTGCT 
TATG 


1319 


62°C 


GluB-2/fw 
GluB-2/rv 


TCCGAGctgcAGTAATGGATACCTAGT 
GT AGTTt CtAg AG CT ATTAG C AGTTG C 


1028 


58°C 


PG5a/fw2 
PG5a/rv2 


CGGTGcTGcAGATGGGTTGGGAACCCT 
ATGATCTaqATTGCTCTGGGACATAGAT 


874 


58°C 


RP6/fw 
RP6/rv 


AATTCCTg CagC ATCGGCTT AGGTGTA 
TGATCTagATTGTTGTTGGATTCTACT 


684 


58°C 


Osglb/fw2 
OSqlb/rv 


GGCGCCTGcAGGGAGGAGAGGGGAGA 
GAT 

ACCTTG CTctag ATTG ATGATCAATCAG A 


997 


58°C 


Bx7/fw2 
Bx7/rv 


CGTCGTCTcTGcAGGCCAGGGAAAGAC 
AATG 

CGCTTAtCtAgaTCAGTGAACTGTCAGTG 


993 


62°C 



All transcription factor ("effector") plasmids where the coding sequences were 
5 placed under the control of the CaMV 35S promoter were generated by subcioning 
cDNA fragments of either 02, o2676 or PBF downstream of the CaMV 35S promoter 
and the Adh1 intron and upstream of the nos 3' end in pMF6. For 02, the BglU 02A1 
cassette described by Schmidt et al., 1992 (which removes the start codons for the 
three small ORFs present in the 5 1 leader sequence, thus promoting increased 02 

10 expression), was inserted into the BamHl site of pMF6. The 02676 effector was 

generated as described for 02 except that an internal restriction fragment containing the 
02676 point mutation (Aukerman, 1991) was substituted for the corresponding 
restriction fragment in BglU 02AI cassette. For PBF, a BamH\-Xho\ fragment containing 
the entire PBF cDNA (VicenteCarbajosa, 1 997) was subcloned into the same sites of 

15 pMF6. 02A1 and PBF were also expressed under the control of the maize UBI1 

promoter and first intron by subcioning the respective cDN A clones into the BamHl site 
of the pAHC17 plasmid (Christensen et al., 1996). 

Figures T1A-C are a schematic depiction of plasmids containing the (A) barley 
prolamin box binding factor protein (BPBF), (B) maize prolamin box binding factor 

20 protein (PBF) and (C) the maize opaque2 binding protein (02) transcription factor 
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coding sequences under the control of the rice endosperm-specific glutelin promoter 
(Gt-1 ). 

The construction of antisense plasmids for the 02 and PBF DNA binding 
domains was carried out using PGR primers designed to amplify the highly conserved 
5 region of DNA binding domains of 02 and PBF. The PCR primer sets for 02 were 
(M02/fw: 5 , -TTCTGGGATCCAAGATGGCTACCGAGG~3 , ) and M02/rv: 5'- 
GGGGTCGGATCCGAGATGGGCATGGAC-3', and for PBF (PBF/fw: 5'- 
AGTGGGGATCCTAAGCCGAGGCCGCAAC-3' and PBF/rv: 5'- 

G GTAG G G G ATC CTG GTG CAT AG GT AG C A-3* , resulting in amplification of 333 bp and 
10 278 bp amplification fragments, respectively when Ubi:02 and Lfo/:PBF were used as 
the template. 

The PCR reactions were performed in 50 pi of 1X PCR reaction buffer (50 mM 
KCI, 10 mM Tris-HCt, pH 9.0, and 0.1 % Triton X-100) containing 250 pM dNTP, 1 ng of 
template, 0.1 pM of each of forward and reverse primers, and 0.3 unit of Tag 

15 polymerase (Perkin-Elmer). The GeneAmp PCR system (model 2400, Perkin-Elrner) 
was programmed for an initial denaturing temperature of 94 °C for 4 min, a 30 sec 
denaturing temperature of 94 °C, an annealing temperature of 58 °C for 30 sec, and an 
extension temperature of 72 °C for 2 min. The reaction was carried out for 30 cycles. 
An additional extension at 72 °C followed for 5 min was allowed to proceed after 

20 completion of the final cycle. The PCR products were purified by 

phenol:chloroform:isoamyl alcohol extraction procedure and precipitated with 100 % 
ethanoL After resuspending in 50 pi of dH20, the amplified products were digested with 
BamHL The BamH\ flanked PCR products were used to replace the luciferase coding 
region of pAHC18 [Christensen et ai t 1996]. 

25 Insertion of the PCR-amplified DNA binding domain of 02 and PBF in antisense 

orientation into the expression cassette containing the ubiquitin promoter was 
completed by PCR using the primer sets M02/rv and NOS/rv; 5'- 
CGGCAACAGGATTCAATCT-3\ PBF/rv and NOS/rv. The PCR was performed under 
the conditions described above, with the exception that the annealing temperature was 

30 changed to 53 °C. 

B. Transient Assay using Rice Endosperm 

DNA coated gold particles were prepared by mixing 50 |jl of gold suspension (60 
mg/ml), 50 pi of CaCI 2 , 2.5 M and 20 pi of spermidine, 0.1 M. In all cases, 5 pg of the 
35 GUS chimeric construct and 5 pg of pAHC1 8 (containing the luciferase gene under the 
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control of the ubiquitin promoter), were used. For co~transfection with effector piasmids, 
effector piasmids were additionally added in the amount indicated. The total amount of 
piasmids used for coating gold particles remained constant by adding the pBluescriptll 
KS (+) or the PMF plasmid (containing the CaMV 35S promoter driving an expression 
cassette which lacks a coding region). After vortexing for 1 min, the gold particles were 
washed with 100 % ethanol twice and finally resuspended in 50 pi of 100 % ethanol. 10 
pi of gold particle suspension was loaded into a macrocarrier for bombardment 

Rice immature seeds at 7 to 9 DAP (days after pollination) were harvested and 
sterilized with husk by incubating 10 minutes in 70 % ethanol and followed by spraying 
with 100 % ethanol. After the ethanol evaporated completely, transient assay 
incubation buffer (TAIB: complete AA medium [Thompson, 1986] supplemented with 
NH4NO3 1.4 g/L, 100pg/ml of cefotaxime and 100 pg/ml of timentin ) was added to 
prevent seeds from drying out. A portion of each seed grain which contains the embryo 
(about one fifth) was cut off using a sharp blade and the immature endosperm was 
squeezed out. About 10 rice immature endosperms were placed in the center of a 3 
MM Whatman filter paper, pre wetted with TAIB. Particle bombardment was carried out 
using the biolistic helium gun device (Dupont PSD-1000), as described Hwang et al„ 
1998. After bombardment, the bombarded immature endosperms were incubated in 5 
ml of TAIB in the dark at 25 °C for 1 day. 

After incubation, immature endosperms were transferred to a conical Eppendorf 
tube and homogenized in 55 pi of extraction buffer (KH 2 P0 4 , 0.1 M, EDTA, 1 mM and p- 
mercaptoethanol, 7 mM) by a disposable plastic pestle. After spinning down the cell 
debris by centrifuging at 15,000 rpm for 15 min, 20 pi of supernatant was used in an 
assay for GUS or luciferase enzyme activity [deWet, 1 987; Bronstein, 1 994]. 

C. Transcriptional Activation with 02 and PBF 

Transient expression assays were carried out to evaluate the effector activity of 
the 02 and PBF transcription factors on GUS expression. 

Rice immature endosperms were isolated from caryopses at 7-9 DAP 
(DAP:Days After Pollination). The amount of each effector plasmid used is indicated in 
the individual examples, below. The total amount of plasmid for coating the gold 
particles remained constant by adding pBluescriptll KS(+). To normalize transfection 
efficiency, in every experiment, a luciferase gene under the control of the maize 
ubiquitin promoter was co-transfected (with 5 pg of pAHC18 (Ubi/LUC/NOS) used as an 
internal control for all experiments). Following measurements of GUS and LUX activity, 
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the GUS expression level was normalized by dividing by the LUX activity from luciferase 
to obtain the GUS/LUX ratio. Therefore, the GUS/LUX ratio quantitatively indicates the 
transcriptional activity from the promoter of the heterologous reporter gene. 

Developing rice endosperms at 7-9 DAP were bioiistically bombarded with 
5 various heterologous nucleic acid constructs to examine the ability of the maize trans 
acting factor genes encoding opaque 2 (02) and prolamin box binding factor (PBF) to 
effect expression of heterologous nucleic acid constructs under the control of the 
promoter regions from the rice glutelin genes Gt3, GluB-1 and GluB-2; the rice prolamin 
genes RP6 and PG5a; the rice globulin gene Gib; and the wheat glutelin gene, Bx7. 02 

1 0 and PBF were expressed under the control of the Ubi promoter in the Ubi:Q2 and 
Ubi:PBF constructs, respectively. 

As shown in Figure 1 3B, co-transfection of the Gf1 reporter plasmid 
(Gt1/GUS/NOS) with 35S.02 and 35S:PBF resulted in approximately a 5 and 3-fold in 
transcription activity, respectively, relative to the activity from the Gt1 reporter plasmid 

15 alone (Figure 13B). 

When immature endosperms were co-bombarded with an equimolar mixture of 
the 35S:02 and 35S:PBF constructs, GUS expression from Gt1/GUS/NOS increased up 
to 6-fold (Fig. 14B). Transient assays using effector plasmids expressing mutant forms 
of the 02 and PBF transcription factors demonstrated that the observed activation is a 

20 consequence of direct interaction between 02 and PBF and the Gt1 promoter. The 
relative GUS/LUX ratio from the Gt1 reporter plasmid was not affected by co- 
bombardment with effector plasmids expressing defective forms of the 02 and PBF 
proteins (Fig. 14B) indicating that the capability of 02 and PBF to bind their specific 
target sites is critical for transactivation of the Gt1 promoter. 

25 Figure 13B illustrates transactivation of the Gt1 promoter by various amounts of 

02 and PBF effector plasmids in rice immature endosperms. As shown in Fig 13B, 
Ubi:02 and UbhPBF at the amount of 1 pg were able to transactivate the Gt1 promoter, 
approximately 4 and 3 fold, respectively. This transactivation effect increased along 
with the increase in the amount of effector up to approximately 5 pg. The increase in 

30 promoter activity by 02 and PBF together was shown to be additive, independent of the 
amount of plasmid combined (Fig. 13B). 

As described above, in addition to the promoter of rice glutelin gene (Gf1), 
several other promoters of genes encoding different kinds of seed storage proteins like 
rice globulin, rice prolamin and wheat glutenin were tested for their responsiveness to 

35 02 and PBF (Fig. 12A). The transcription activity from the promoter of rice globulin, Gib 
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was increased by about 3 and 4 fold, in the presence of 02 and PBF, respectively (Fig. 
12B). Bx7, the glutelin promoter from wheat, was shown to be transactivated in rice 
immature endosperms by co-bombardment with 02 and PBF up to about 1 .8 fold, 
respectively (Fig. 12B). The 02 and PBF effectors were also shown to transactivate the 
5 rice prolamin gene promoters, RP6 about 5 and 3.5 fold, and PG5a % about 2 and 1.5 
fold, respectively (Fig. 12B). 

The effect of 02 and PBF on the promoters from different kinds of rice storage 
genes including Gt1, Gt3 t GluB-1 and GluB-2; the rice prolamin genes RP6 and PG5a\ 
the rice globulin gene Gib and the wheat glutelin gene, 8x7 was tested. Table 2 shows 

10 the responsiveness of the promoters to 02 and PBF, with an additive increase in 
transactivation observed by co-transfection with both effector plasmids. Of the 
promoters tested, the Gt1 promoter was the most responsive to both of 02 and PBF and 
co-transfection with both plasmids gave about a 15 fold increase in GUS activity. The 
additive increase in promoter activity by co-bombardment of both effector plasmids was 

15 observed in all the promoters of storage protein genes that were tested. The rice actin 
promoter was much less responsiveness to 02 and PBF effectors and the promoter 
activity of the CaMV 35S promoter was not affected by co-transfection with 02 and PBF 
effector plasmids (Table 13). 

Table 1 3 GUS expression in rice immature endosperms in response to Q2 and 

20 PBF. 



OEffector 


02 i 


PBF 1 


02/PBF 1 


Gt1 


10.8(2.14) 


4.57(1.55) 


14.6(1.08) 


Gib 


3.22(0.72) 


4.42(0.85) 


8.55(2.27) 


RP6 


5.22(1.45) 


3.36(0.71) 


8.20(1.18) 


Bx7 


1.78(0.13) 


1 .84(0.38) 


3.81(1.03) 


PG5a 


2.13(0.42) 


1.59(0.18) 


3.14(0.60) 


Actin 


1 .59(0.25) 


1.14(0.21) 


1.64(0.48) 


CaMV 35S 


1.18(0.04) 


1.06(0.18) 


1.34(0.12) 



1 fold activation was calculated by normalizing the GUS/LUX ratio from rice endosperms co-bombarded 
with each effector and a reporter construct to the GUS/LUX ratio of that effector alone. For example, for 
GT1, all results are given relative to the GUS/LUX ratio of Gt1/GUS/NOS construct. 
S.D. indicates the standard deviation of a mean value for at least five independent particle bombardments. 

1 fold activation was calculated by normalizing the GUS/LUX ratio from rice endosperms co-bombarded 
with each effector and a reporter construct to the GUS/LUX ratio of that effector alone. For example, for 
GT1 , all results are given relative to the GUS/LUX ratio of Gtt/GUS/NOS construct. 
S.D. indicates the standard deviation of a mean value for at least five independent particle bombardments. 

1 Results are presented as "fold activation(SD)\ calculated by normalizing the GUS/LUX ratio from rice 
endosperms co-bombarded with each effector and a reporter construct to the GUS/LUX ratio of that effector 
alone. For example, for GT1 , all results are given relative to the GUS/LUX ratio of Gt1/GUS/NOS construct. 
S.D. indicates the standard deviation of a mean value for at least five independent particle bombardments. 
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EXAMPLE 15 

Generation of Transgenic Plants Which Express Plant Transcription Factors 
Heterotogous nucleic acid constructs were prepared for generation of stable 
transgenic plant lines. Figs. 15A and B present a schematic depiction of exemplary 
5 plasmids for use in generating such stable transgenic plants. The plasmids contain a 
heterologous protein coding sequence for lactoferrin and lysozyme under the control of 
the rice endosperm-specific globulin promoter (Gib), as shown in Fig. 15A and Fig. 15B, 
respectively. 

10 A. Vector Constructs 

A gene encoding the mature polypeptide of human lysozyme (EC 3.2.1 .17) with 
a G+C content of 68.4 % was synthesized (Operon, Alameda, CA, USA) based on the 
sequence of GenBank Accession number J03801. The DNA was digested with 
Dral/Xhol, and ligated into the Nael/Xhol sites of the expression cassette in plasmid 

15 API241 which contains the rice globulin gene promoter and signal peptide (GenBank 
Accession Number X63990). The resulting plasmid was named pAPI264. 

B. Development of Stable Transformants and Transgenic Plants 
Microprojectile-mediated transformation of rice was carried out based on the 

20 procedure described in [Chen, 1998]. Calli were derived from the hypocotyls of 
germinating mature seeds of the cultivar TP309 (Oryza sativa, subspec. Japonica). 
Calli with a diameter of 2-4 mm were selected and placed on a N6 medium (Sigma, 
Louis, MO, USA) supplemented with 0.3M mannitol and 0.3M sorbitol for about 20 hours 
before bombardment. Bombardment was carried out with the biolistic PDC-1000/He 

25 instrument (Bio-Rad, Hercules, CA, USA). Fifty yd of gold particles (60mg,mT 1 at 1:1 
ratio of 1 .0 and 1 .5-3.0 jxm diameter gold particle) were coated with effector DNA, target 
DNA and selection marker DNA in a ratio of 3:3:1 (w/w) and accelerated with a helium 
pressure of 1 100 psi. After two day's incubation, calli were transferred to N6 selection 
media containing 35mg ; l" 1 hygromycin B and allowed to grow in the dark at 26°C for 45 

30 days. Calli resistant to hygromycin B were transferred to regeneration media and used 
to generate plantlets as described in [Chen, 1998]. After shoots had reached a height of 
1-3 cm, the plantlets were transferred to rooting media (MS plus 0.05 mg-P 1 a~ 
Naphthaleneactic acid, Sigma, Louis, MO, USA) and two weeks later the plantlets were 
transferred to soil and grown in the greenhouse to maturity. 
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C. PCR Analysis of Transgenic Plants 

Genomic DNA was prepared from samples of transgenic plant leaves as 
described in [Dellaporta, 1983 59] and used as the template for amplification with two 
pairs of primers for the identification of transgenes. For the rice Reb gene, the forward 
5 primer 5 -CCATCCAATCCAATCCACTCCAAC-3' (Fig. 16A) was designed based on a 
3' untranslated terminal sequence of the gene, and the reverse primer was designed 
based on the vector sequence S'-AGGCGATTAAGTTGGGTAACG-S' (Fig. 16A). For 
the human lysozyme transgene, the forward primer was designed based on the 5' end 
of the open reading frame of the gene S'-CCTAGCCAAAGTCTT CGAGCGGTG-3* (Fig. 

10 16B), and the reverse primer was designed based on the 3' end of the open reading 
frame of the gene 5'-GCGATGTTGTCTTGCAGC-3' (Fig. 16B). The PCR mixture 
contained 100 ng genomic DNA, 10 mM Tris (pH 8.3), 1.5 mM MgCI 2 , 50 mM KCI, and 
0.5 ^M dNTP. Amplification employed a program of denaturing at 94 °C for 5 min 
followed by 30 cycles of 94 °C for 30 seconds, 60 °C for 30 seconds and 72 °C for 45 

15 seconds. The PCR products were resolved by electrophoresis in a 1.2 % agarose gel. 

D. Lysozyme Activity Assay 

A lysozyme assay was carried out using a procedure where twenty individual 
seeds from each T1 transgenic plant were ground in 1 ml of pre-cold extraction buffer 

20 (PBS plus 0.35 M NaCI). After centrifugation at 25,000g for 5 min at 4°C, the 

supernatant was recovered. A series of dilutions were made and an aliquot was added 
to a 96-well microtiter plate containing 250 jul of 0.015 % Micrococcus letus cells in each 
well (Sigma, Louis, MO, USA; procedure developed at Applied Phytologics Inc.). 
Human lysozyme (EC 3.2. 1.17, Sigma, Louis, MO, USA) was used as the standard and 

25 lysozyme activity was measured based on the decrease in turbidity, evaluated using a 
Microplate Reader 3550 (Bio-Rad, Hercules, CA, USA). The lysozyme concentration in 
the samples was determined based on absorbance values of samples relative to a 
standard curve prepared using different concentrations of human lysozyme. The 
lysozyme expression level in a given transgenic plant was calculated as the average 

30 lysozyme content of the twenty seeds taken from that plant Total soluble protein in 
seed extracts was estimated using the Bradford protein assay (Bio-Rad, Hercules, CA, 
USA). 
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E. Enhanced Human Lvsozvme Expression in Transgenic Rice Seed Co- 
Transformed with Reb 

In order to evaluate the ability of Reb to increase the expression of a transgene 
in transgenic plants, plants cells were co-transformed with heterologous nucleic acid 
5 constructs comprising the human lysozyme gene driven by the Gib promoter (Glb-lys) 
and the Reb gene driven by the actin promoter (Act-Reb) and with the Act-Reb construct 
alone. Normal plant phenotypes were obtained among transformants containing both 
Glb-lys and native Reb and PCR analysis confirmed that both genes had integrated into 
the genome in 10 out of 11 plants (Fig. 17). Transgenic seeds were harvested at 

1 0 maturity and lysozyme activity was analyzed. 

Lysozyme expression in seeds of the 10 transgenic plants containing native-Reb 
and GIb-iys ranged from 31-133 M9* m 9~ 1 soluble protein with an average of 69.8±11.6 
pg-mg" 1 soluble protein (Fig. 17). Seeds taken from seventeen transgenic plants 
containing Glb-Lys alone expressed lysozyme in amounts ranging from 7 to 76 pg-mg" 1 

15 soluble protein with an average of 33.95±4.96 pg *mg' 1 soluble protein (Fig. 17). No 
lysozyme activity was detected in untransformed rice seeds (Fig. 17). The results 
showed that the expression level of the lysozyme increased on an average 2-fold, when 
the seeds were transgenic for both the Reb gene and Glb-lys. The statistical analysis (t- 
test) showed that the amount of lysozyme in seeds from the plants transgenic for the 

20 Reb gene and Glb-lys is significantly higher than in the plants with Glb-lys alone 
(P<0.001). 
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Description 


SEQ 

ID 

NO 


Codon optimized lysozyme coding sequence: 

AAAGTCTTCGAGCGGTGCGAGCTGGCCCGCACGCTCAAGCGGCTCGGCAT 
GGACGGCTACCGGGGCATCAGCCTCGCCAACTGGATGTGCCTCGCCAAGT 
GGGAGTCGGGCTACAACACCCGCGCAACCAACTACAACGCCGGCGACCGC 
TCCACCGACTACGGCATCTTCCAGATCAACTCCCGCTACTGGTGCAACGAC 
GGCAAGACGCCCGGGGCCGTCAACGCCTGCCACCTCTCCTGCTCGGCCCT 
GCTGCAAGACAACATCGCCGACGCCGTCGCGTGCGCGAAGCGGGTCGTCC 
GCGACCCGCAGGGCATCCGGGCCTGGGTGGCCTGGCGCAACCGCTGCCA 
GAACCGGGACGTGCGCCAGTACGTCCAGGGCTGCGGCGTCTGA 


1 


Amino acid sequence based on codon optimized lysozyme coding sequence: 
KVFERCELARTLKRLGMDGYRGiSLANWMCLAKWESGYNTRATNYNAGDRST 
DYGIFQINSRYWCNDGKTPGAVNACHLSCSALLQDNIADAVACAKRWRDPQGI 
RAW VAW RN RCQN RD VRQ YVQG CG V 


2 


Codon optimized lactoferrin coding 

sequence:GGGCGGCGGCGGCGCTCGGTGCAGTGGTGCGCCGTGTCCCAGC 

CCGAGGCGACCAAGTGCTTCCAGTGGCAGCGCAACATGCGGAAGGTGCGC 

GGCCCGCCGGTCAGCTGCATCAAGCGGGACTCCCCCATCCAATGCATCCAG 

GCCATCGCGGAGAACCGGGCCGACGCGGTCACCCTGGACGGCGGGTTCAT 

CTACGAGGCGGGGCTCGCCCCGTACAAGCTCCGCCCGGTGGCGGCGGAG 

GTGTACGGCACCGAGCGCCAGCCGCGCACGCACTACTACGCGGTGGCCGT 

CGTCAAGAAGGGCGGGTCGTTCCAGCTCAACGAGCTGCAGGGCCTGAAGT 

CGTGCCACACGGGCCTCCGGCGGACGGCGGGCTGGAACGTGCCCATCGG 

CACCCTGCGCCCCTTCCTGAACTGGACCGGCCCGCCGGAGCCGATCGAGG 

CCGCCGTGGCCCGCTTCTTCAGCGCCTCCTGCGTCCCCGGCGCCGACAAG 

GGCCAGTTCCCGAACCTCTGCCGGCTCTGCGCCGGGACGGGCGAGAACAA 

GTGCGCCTTCTCCTCGCAGGAGCCGTACTTCTCCTACTCGGGCGCGTTCAA 

GTGCCTCCGCGACGGGGCCGGCGACGTGGCGTTCATCCGCGAGTGCACCG 

TGTTCGAGGACCTCTCCGACGAGGCGGAGCGGGACGAGTACGAGCTGCTG 

TGCCCCGACAACACCCGCAAGCCGGTGGACAAGTTCAAGGACTGCCACCTG 

GCGCGGGTGCCCTCGCAGGCGGTCGTCGCCCGCAGCGTCAACGGCAAGGA 

GGACGCGATCTGGAACCTCCTCCGCCAGGCCCAGGAGAAGTTCGGCAAGG 

ACAAGTCCCCCAAGTTCCAGCTCTTCGGGAGCCCCAGCGGCCAGAAGGACC 

TCCTCTTCAAGGACTCCGGGATCGGCTTCTCCCGCGTCCCCCCGCGCATCG 

ACTCCGGCCTGTACCTCGGCTCCGGGTACTTCACCGCGATCCAGAACCTCC 

GGAAGAGCGAGGAGGAGGTGGCGGCGGGGCGGGCCCGCGTCGTGTGGTG 

CGCCGTGGGCGAGCAGGAGCTGCGGAAGTGCAACGAGTGGAGCGGCCTGA 

GCGAGGGGTCGGTGACCTGCTCGTCCGCCAGCACCACCGAGGACTGCATC 

GCGCTCGTCCTCAAGGGGGAGGCCGACGCGATGAGCCTCGACGGGGGGTA 

CGTCTACACCGCCGGCAAGTGCGGCCTGGTCCCGGTCCTGGCGGAGAACT 

ACAAGTCGCAGCAGTCCAGCGACCCCGACCCGAACTGCGTGGACCGCCCC 

GTCGAGGGCTACCTCGCCGTGGCCGTCGTGCGCCGGTCCGACACCTCCCT 

GACGTGGAACAGCGTCAAGGGCAAGAAGAGCTGCCACACCGCCGTGGACC 

GCACCGCCGGCTGGAACATCCCGATGGGCCTCCTCTTCAACCAGACCGGCT 

CCTGCAAGTTCGACGAGTACTTCTCCCAGTCCTGCGCCCCCGGCTCGGACC 

CCCGCTCCAACCTGTGCGCCCTCTGCATCGGGGACGAGCAGGGCGAGAAC 

AAGTGCGTGCCCAACAGCAACGAGCGGTACTACGGCTACACGGGGGCCTT 

CCGCTGCCTGGCGGAGAACGCGGGGGACGTCGCGTTCGTGAAGGACGTGA 

CCGTGCTGGAAAACACGGACGGGAACAACAACGAGGCGTGGGCGAAGGAC 

CTCAAGCTCGCCGACTTCGCCCTGCTGTGCGTCGACGGCAAGCGCAAGCCC 

GTCACCGAGGCGCGGTCCTGCCACCTGGCGATGGCCCCCAACCACGCCGT 

CGTCTCCCGCATGGACAAGGTCGAGCGCCTCAAGCAGGTGCTCCTGCACCA 

GCAGGCCAAGTTCGGCCGGAACGGCAGCGACTGCCCGGACAAGTTCTGCC 


3 
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TGTTCCAGTCGGAGACCAAGAACCTCCTCTTCAACGACAACACCGAGTGCCT 
GGCGCGCCTCCACGGCAAGACCACCTACGAGAAGTACCTCGGCCCGCAGT 
ACGTCGCCGGCATCACCAACCTCAAGAAGTGCTCCACCTCCCCCCTCCTGG 
AGGCGTGCGAGTTCCTCCGCAAGTGA 




Amino acid sequence based on codon optimized lactoferrin coding sequence: 

GRRRRSVQWCAVSQPEATKCFQWQRNMRKVRGPPVSGIKRDSPIQCIQAIAEN 

RADAVTLDGGFtYEAGLAPYKLRPVAAEVYGTERQPRTHYYAVAWKKGGSFQL 

NELQGLKSCHTGLRRTAGWNVPIGTLRPFLNWTGPPEPIEAAVARFFSASCVPG 

ADKGQFPNLCRLCAGTGENKCAFSSQEPYFSYSGAFKCLRDGAGDVAFIREST 

VFEDLSDEAERDEYELLCPDNTRKPVDKFKDCHLARVPSHAWARSVNGKEDAI 

WNLLRQAQEKFGKDKSPKFQLFGSPSGQKDLLFKDSAIGFSRVPPRIDSGLYLG 

SGYFTAIQNLRKSEEEVAARRARWWCAVGEQELRKCNQWSGLSEGSVTCSS 

ASTTEDCIALVLKGEADAMSLDGGYNATAGKCGLVPVLAENYKSQQSSDPDPN 

CVDRPVEGYLAVAWRRSDTSLTWNSVKGKKSCHTAVDRTAGWNIPMGLLFNQ 

TGSCKFDEYFSQSCAPGSDPRSNLCALCIGDEQGENKCVPNSNERYYGYTGAF 

RCLAENAGDVAFVKDVTVLQNTDGNNNEAWAKDLKLADFALLCLDGKRKPVTE 

ARSCHLAMAPNHAWSRMDKVERLKQVLLHQQAKFGRNGSDCPDKFCLFQSE 

TKNLLFNDNTECLARLHGKTTYEKYLGPQYVAGITNLKKCSTSPLLEACEFLRK 


4 


MV-Gt1-F1 primer: 

5' ATC GAA GCT TCA TGA GTA ATG TGT GAG CAT TAT GGG ACC ACG 3* 


5 


Xba«Gt1-R1 primer: 5' CTA GTC TAG ACT CGA GCC ATG GGG CCG GCT 
AGG GAG CCA TCG CAC AAG AGG AA 3* 


6 


Codon optimized lactoferricin coding sequence 

ACCAAGTGCTTCCAGTGGCAGCGCAACATGCGGAAGGTGCGCGGCCCGCC 
GGTCAGCTGCATCAAGCGGGAC 


7 


Codon optimized EGF coding sequence 

AACTCCGACTCGGAGTGCCCCCTCTCCCACGACGGTTACTGCCTCCACGAC 
GGGGTCTGCATGTACATCGAGGCCCTCGACAAGTACGCCTGCAACTGCGTC 
GTGGGCTACATCGGCGAGCGGTGCCAGTACCGCGACCTCAAGTGGTGGGA 
GCTGCGCTGA 


8 


Codon optimized IGF-1 coding sequence 

GGCCCGGAGACCCTCTGCGGCGCCGAGCTCGTGGACGCCCTCCAGTTCGT 

GTGCGGCGACCGCGGCTTCTACTTCAACAAGCCGACCGGCTACGGCAGCA 

GCAGCCGCCGCGCCCCGCAGACCGGCATCGTGGACGAGTGCTGCTTCCGC 

AGCTGCGACCTCCGCCGCCTGGAGATGTACTGCGCCCCGCTCAAGCCCGC 

CAAGAGCGCCTGA 


9 


Codon optimized lactohedrin coding sequence 

CTGGACATCTGCTCGAAGAACCCGTGCCACAACGGCGGGCTCTGCGAGGA 

GATCAGCCAGGAGGTGCGGGGCGACGTGTTCCCCTCGTACACCTGCACCT 

GCCTGAAGGGCTACGCCGGGAACCACTGCGAGACGAAGTGCGTGGAGCCC 

CTGGGGATGGAGAACGGCAACATCGCCAACTCCCAGATCGCCGCCTCCTCC 

GTGCGGGTGACCTTCCTCGGCCTCCAGCACTGGGTCCCGGAGCTGGCCCG 

GCTCAACCGGGCGGGCATGGTGAACGCGTGGACCCCCTCGTCCAACGACG 

ACAACCCGTGGATCCAAGTGAACCTGCTCCGCCGCATGTGGGTCACCGGCG 

TGGTCACCCAAGGCGCCAGCCGCCTGGCCAGCCACGAGTACCTCAAGGCC 

TTCAAGGTCGCCTACAGCCTCAACGGCCACGAGTTCGACTTCATCCACGAC 

GTCAACAAGAAGCACAAGGAGTTCGTGGGCAACTGGAACAAGAACGCGGTC 

CACGTGAACCTCTTCGAGACCCCCGTCGAGGCCCAGTACGTCCGCCTCTAC 

CCCACGAGCTGCCACACCGCCTGCACGCTCCGCTTCGAGCTGCTGGGGTG 

CGAGCTGAACGGGTGCGCGAACCCGCTGGGGCTCAAGAACAACAGCATCC 

CCGACAAGCAGATCACGGCCTCGTCGTCGTACAAGACCTGGGGCCTGCACC 

TCTTCTCGTGGAACCCGAGCTACGCCCGGCTGGACAAGCAGGGCAACTTCA 

ACGCCTGGGTCGCCGGGAGCTACGGGAACGACCAGTGGCTCCAGGTGGAC 

CTCGGCAGCTCCAAGGAGGTCACCGGCATCATCACGCAGGGGGCCCGCAA 

CTTCGGCTCCGTGCAGTTCGTGGCCTCCTACAAGGTGGCCTACTCGAACGA 


10 
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CAGGGGCAACTGGACCGAGTACCAGGACCCGCGGACCGGGTCCAGCAAGA 
TCTTCCCCGGCAACTGGGACAACCACAGCCACAAGAAGAACCTGTTCGAGA 
CCCCeATCCTCGCCCGGTACGTCCGCATCCTCCCCGTCGCTTGGCACAACC 
GGATCGCGCTCCGGCTGGAGCTCCTCGGCTGCTGA 




Codon optimized kappa-casein coding sequence 

GAGGTCCAAAACCAGAAGCAGCCCGCCTGCCACGAGAACGACGAGCGCCC 

GTTCTACCAGAAGACCGCACCCTACGTCCCGATGTACTACGTCCCGAACAG 

CTACCCCTAGTACGGTAGGAACGTGTACGAGCGGCGCGCGGCCATCGGTAT 

CAACAAGGCCTAGGTCGCCCGGACCTACTACGCGAACGCGGCCGTGGTGC 

GGCGGCACGGGCAGATCGCGCAGCGGCAGTAGCTGCGAAACAGCCAGGGG 

CCCACGGTGGTGCGGGGGGGCAACGTCCACGCGAGCTTCATGGGTATGGCC 

CCCAAGAAGATCGAGGAGAAGATCATGATCCCGACGATGAACAGGATCGCGA 

CCGTGGAGGGGACGCCAGCGCCCGGGACCGAGGGGACGGTGGACAGGGT 

GGTGACGCCAGAGGCGTTCTCGGAATGGATGATCAGCTGGACCGGCGAGAC 

CAGCACGGTGGGGGTGAGGCGGCGGACGGGATGA 


11 


Godon optimized haptocorrin coding sequence 

GAGATCTGGGAGGTGTCGGAGGAGAAGTAGATGCGGGTCAAGGCCGTGCTG 

AAGAGGATGATCCAGAGCAACTACAACCGGGGCACGTCGGGCGTGAACGTC 

GTGCTCTCGCTGAAGCTGGTGGGCATCCAGATCGAGACGCTGATGCAGAAG 

ATGATGGAGCAGATCAAGTAGAACGTGAAGAGCCGGCTCTCGGACGTGTCC 

AGGGGCGAGGTGGCGGTGATCATCGTCGCGCTCGGCGTGTGCCGGAAGGC 

GGAGGAGAAGGTCATCTAGGACTACCACCTCACGGACAAGCTGGAGAACAA 

GTTGGAGGGCGAGATGGAGAACATGGAGGGCCAGAAGGGGAGCGGGGTGA 

CCAAGTAGTACCAGCTCAGCCTGGAGGTCCTCGCGCTCTGGCTGTTGAACG 

GGAAGTAGTGGACCGGGGAGGTGGTCAACCACTTGAGCCCCGAGAACAAGA 

ACTACTAGTTGGGGTCGGAGTTCTCCGTGGAGACCGGGGGGATGGGCGTCG 

TGGCCCTCACCTGCGTGAAGAAGTGGGTCATGAAGGGCCAGATGAAGGCCG 

ACC3AGGGCTCCGTGAAGAAGATCTCGATCTACAGGAAGAGCGTCGTGGAGA 

AGATCCTGAGCGAGAAGAAGGAGAACGGGGTGATGGGGAAGACGTTCTCGA 

GGGGGGAGGGGATGCAGGCGCTGTTGGTGAGCAGCGACTACTAGAAGGAG 

AACGACTGGAACTGGGAGGAGAGGCTGAACAGGGTCCTGACCGAGATCAGC 

CAGGGGGCGTTGAGCAACGCCAACGCCGCCGCCCAGGTCCTGCGGGGGCT 

GATGGGCAAGACCTTCGTCGACATCAACAAGGAGAGGTGCTGCGTGTCGGG 

GAGCGGCAAGTTGAACATCTCCGCCGAGGAGCGGATCACGGTGACGCCGC 

CCGACAGCCAGTCGTACATCTCCGTGAACTACAGCGTGGGGATGAACGAGA 

CGTACTTCAGGAACGTGAGGGTCCTCAAGGGCTCGGTCTTGCTGAGCGTGA 

TGGAGAAGGCGCAGAAGATGAACGACACGATCTTCGGCTTCACGATGGAGG 

AGCGCAGCTGGGGCCCCTACATCACCTGCATCCAGGGCCTCTGCGCCAACA 

ACAACGACCGCACCTACTGC5GAGCTGCTGAGCGGCGGCGAGCCGCTGAGC 

CAGGGGGCCGGCAGCTACGTGGTCCGCAACGGCGAGAACCTGGAGGTGCG 

GTGGAGGAAGTACTGA 


12 


Codon optimized lactoperoxidase coding sequence 

CAAAGGAGCCGGACGTCGGCGATGTCCGACACGGTCTCGCAGGCCAAGGT 

GCAAGTCAACAAGGCATTCCTGGATTCGCGCACGCGGCTGAAGACCGCGAT 

GTCGTCCGAGACCCCGACGAGCCGGCAGCTGAGCGAGTACCTCAAGCACG 

CGAAGGGGCGGACGCGCACCGCCATCCGCAATGGCCAAGTGTGGGAGGAA 

TCCCTGAAGCGGCTGCGGCAGAAGGCGTCGCTCACCAACGTGACGGACCC 

GTCCCTCGACCTGACCAGCCTCTCCCTGGAGGTCGGCTGCGGCGCCCCGG 

CGCCCGTCGTGCGCTGCGACCCCTGCTCGCCATACCGCACGATCACGGGC 

GACTGCAACAACCGGCGGAAGCCGGCACTGGGGGCTGCGAACCGCGCCCT 

CGCGCGCTGGCTCCCCGCCGAGTACGAGGACGGCCTCAGCCTCCCCTTCG 

GTTGGAGCCCCGGCAAGACGCGCAACGGCT 1 UOUCjO 1 OUOUCT CCsC 1 GvjO 

GAGGTCAGCAACAAGATCGTCGGTTACCTGAACGAGGAGGGGGTCCTCGAC 

CAAAACCGCTCGCTCCTCTTCATGCAGTGGGGGCAGATCGTGGACCACGAC 

CTGGACTTCGCCCCGGACAGGGAGCTGGGCTCCAGCGAGTACAGCAAGAC 

CCAGTGCGACGAATACTGCATCCAGGGCGACAACTGCTTCCCGATCATGTT 

CCCCCCGAACGACCCGAAGGCGGGCACCCAGGGCAAGTGCATGCCGTTCT 

TCCGGGCAGGCTTCGTCTGCCCGACCCCCCCGTACAAGTCCCTCGCGCGC 
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GAGCAGATCAACGCGCTCACGTCCTTCCTCGACGCCAGCTTCGTCTACAGC 

AGCGAGCCGTCCCTCGCCAGCCGCCTCCGCAAGCTCAGCAGCCCCCTCGG 

CCTCATGGCGGTCAACCAGGAGGTGTCGGACCACGGCCTCCCATACCTGCC 

GTACGACAGCAAGAAGCCGTCCCCCTGCGAGTTCATCAACACCACCGCGCG 

CGTCCCGTGCTTCCTCGCCGGCGATTCGCGGGCGAGCGAGCACATCCTGC 

TCGCCACGAGCCACACCCTGTTCCTCCGCGAGCACAACCGCCTCGCCCGG 

GAGCTGAAGCGCCTCAACCCGCAGTGGGACGGCGAGAAGCTCTACCAGGA 

GGCCCGGAAGATCCTCGGCGCTTTCGTCCAGATCATCACCTTCCGGGACTA 

CCTCGCCATCCTGCTCGGTG ACCAC ATGCAG AAGTGG ATCCCCCCCTACCA 

AGGCTACTCCGAGAGCGTGGACGCGCGCATCTCCAACGTCTTCACGTTCGC 

GTTCCGCTTCGGGCACCTGGAGGTGCCGTCGTCGATGTTCCGCCTCGACGA 

GAACTACCAGCCCTGGGGCCCAGAGCCGGAGCTGCCGCTCCACACCCTGT 

TCTTCAACACCTGGCGGATGGTCAAGGACGGCGGCATCGACCCGCTCGTGC 

GGGGGCTCCTGGCTAAGAAGTCGAAGCTCATGAAGCAGAACAAGATGATGA 

CCGGCGAGCTGCGCAACAAGCTGTTCCAGCCCACCCACCGCATCCACGGG 

TTCGACCTGGCTGCAATCAACACCCAGCGGTGCCGCGACCACGGCCAGCC .> 

CGGCTACAACTCGTGGCGCGCGTTCTGCGACCTCTCCCAGCCACAGACGCT 

GGAGGAGCTCAACACCGTGCTCAAGAGCAAGATGCTCGCCAAGAAGCTGCT 

CGGGCTCTACGGCACGCCCGACAACATCGACATCTGGATCGGGGCCATCG 

CGGAGCCGCTCGTGGAGCGCGGGCGCGTCGGCCCGCTGCTCGCGTGCCT 

CCTGGGCAAGCAATTCCAACAGATCCGCGACGGGGACCGGTTCTGGTGGG 

AGAACCCCGGCGTGTTCACCAACGAGCAGAAGGATTCGCTCCAAAAGATGA 

GCTTCTCCCGCCTGGTGTGCGACAACACCCGCATCACCAAGGTCCCGCGCG 

ACCCATTCTGGGCCAACTCCTACCCGTACGACTTCGTGGACTGCTCCGCCA 

TCGACAAGCTCGACCTGTCCCCCTGGGCATGGGTGAAGAACTGA 




Codon optimized alpha-1 -antitrypsin coding sequence 

GAGGACCCGCAGGGCGACGCCGCCCAGAAGACCGACACCAGCCACCACGA 

CCAGGACCACCCGACGTTCAACAAGATCACCCCGAATTTGGCCGAATTCGC 

CTTCAGCCTGTACCGCCAGCTCGCGCACCAGTCCAACTCCACCAACATCTTC 

TTCAGCCCGGTGAGCATCGCCACCGCCTTCGCCATGCTGTCCCTGGGTACC 

AAGGCGGACACCCACGACGAGATCCTCGAAGGGCTGAACTTCAACCTGACG 

GAGATCCCGGAGGCGCAGATCCACGAGGGCTTCCAGGAGCTGCTCAGGAC 

GCTCAACCAGCCGGACTCCCAGCTCCAGCTCACCACCGGCAACGGGCTCTT 

CCTGTCCGAGGGCCTCAAGCTCGTCGATAAGTTCCTGGAGGACGTGAAGAA 

GCTCTACCACTCCGAGGCGTTCACCGTCAACTTCGGGGACACCGAGGAGGC 

CAAGAAGCAGATCAACGACTACGTCGAGAAGGGGACCCAGGGCAAGATCGT 

GGACCTGGTCAAGGAATTGGACAGGGACACCGTCTTCGCGCTCGTCAACTA 

CATCTTCTTCAAGGGCAAGTGGGAGCGCCCGTTCGAGGTGAAGGACACCGA 

GGAGGAGGACTTCCACGTCGACCAGGTCACCACCGTCAAGGTCCCGATGAT 

GAAGAGGCTCGGCATGTTCAACATCCAGCACTGCAAGAAGCTCTCCAGCTG 

GGTGCTCCTCATGAAGTACCTGGGGAACGCCACCGCCATCTTCTTCCTGCC 

GGACGAGGGCAAGCTCCAGCACCTGGAGAACGAGCTGACGCACGACATCA 

TCACGAAGTTCCTGGAGAACGAGGACAGGCGCTCCGCTAGCCTCCACCTCC 

CGAAGCTGAGCATCACCGGCACGTACGACCTGAAGAGCGTGCTGGGCCAG 

CTGGGCATCACGAAGGTCTTCAGCAACGGCGCGGACCTCTCCGGCGTGAC 

GGAGGAGGCCCCCCTGAAGCTCTCCAAGGCCGTGCACAAGGCGGTGCTCA 

CGATCGACGAGAAGGGGACGGAAGCTGCCGGGGCCATGTTCCTGGAGGCC 

ATCCCCGTGTCCATCCCGGCCGAGGTCAAGTTCAACAAGCCCTTCGTCTTCC 

TGATGATCGAGCAGAACACGAAGAGCCCCCTCTTCATGGGGAAGGTCGTCA 

ACCCCACGCAGAAGTGA 

Codon optimized immunoglobulin-A coding sequence 


14 


Rice Gt1 promoter and Gt1 leader coding sequence 

CATGAGTAATGTGTGAGCATTATGGGACCACGAAATAAAAAGAACATTTTGAT 

GAGTCGTGTATCCTCGATGAGCCTCAAAAGTTCTCTCACCCCGGATAAGAAA 

CCCTTAAGCAATGTGCAAAGTTTGCATTCTCCACTGACATAATGCAAAATAAG 

ATATCATCGATGACATAGCAACTCATGCATCATATCATGCCTCTCTCAACCTA 

TTCATTCCTACTCATCTACATAAGTATCTTCAGCTAAATGTTAGAACATAAACC 


15 



119 



NSDOCID: <WO 02064750A2 I > 



WO 02/064750 



PCT/US02/04909 



CATAAGTCACGTTTGATGAGTATTAGGCGTGACACATGACAAATCACAGACT 

CAAGCAAGATAAAGCAAAATGATGTGTACATAAAACTCCAGAGCTATATGTCA 

TATTGCAAAAAGAGGAGAGCTTATAAGACAAGGCATGACTCACAAAAATTCA 

CTTGCCTTTCGTGTCAAAAAGAGGAGGGCTTTACATTATCCATGTCATATTGC 

AAAAGAAAGAGAGAAAGAACAACACAATGCTGCGTCAATTATACATATCTGTA 

TGTCCATCATTATTCATCCACCTTTCGTGTACCACACTTCATATATCATAAGA 

GTCACTTCACGTCTGGACATTAACAAACTCTATCTTAACATTTAGATGCAAGA 

GCCTTTATCTCACTATAAATGCACGATGATTTCTCATTG 1 1 1 CTCACAAAAAG 

CGGCCGCTTCATTAGTCCTACAACAACATGGCATCCATAAATCGCCCCATAG 

1 n TCTTCACAGTTTGCTTGTTCCTCTTGTGCGATGGCTCCCTAGCC 

Codon optimized alpha-1 -antitrypsin coding sequence 




Rice Gib promoter and Gt1 leader coding sequence 

CTGCAGGGAGGAGAGGGGAGAGATGGTGAGAGAGGAGGAAGAAGAGGAG 

GGGTGACAATGATATGTGGGGCATGTGGGCACCCAATTTTTTAATTCATTCT 

TTTGTTGAAACTGACATGTGGGTCCCATGAGATTTATTATTTTTCGGATCGAA 

TCGCCACGTAAGCGCTACGTCAATGCTACGTCAGATGAAGACCGAGTCAAAT 

TAGCCACGTAAGCGCCACGTCAGCCAAAACCACCATCCAAACCGCCGAGGG 

ACCTCATCTGCACTGG I I I I GA I AG I IGAGGGACCCGI I G I A I C I GG I I I I I C 

GATTGAAGGACGAAAATCAAA 1 1 1 GTTGACAAGTTAAGGGACCTTAAATGAA 

CTTATTCCATTTCAAAATATTCTGTGAGCCATATATACCGTGGGCTTCCAATC 

CTCCTCAAATTAAAGGGCC 1 1 1 1 1 AAAATAG ATAATTGCCTTCTTTCAGTCAC 

CCATAAAAGTACAAAACTACTACCAACAAGCAACATGCGCAGTTACACACATT 

TTCTGCACATTTCCGCCACGTCACAAAGAGCTAAGAGTTATCCCTAGGACAA 

TCTCATTAGTGTAGATACATCCATTAATC 1 1 1 1 ATCAGAGGCAAACGTAAAGC 

CGCTCTTTATGACAAAAATAGGTGACACAAAAGTGTTATCTGCCACATACATA 

ACTTCAGAAATTACCCAACACCAAGAGAAAAATAAAAAAAAATCTTTTTGCAA 

GCTCCAAATCTTGGAAACC 1 1 1 1 1 CACTCTTTGCAGCATTGTACTCTTGCTCT 

TTTTCCAACCGATCCATGTCACCCTCAAGCTTCTACTTGATCTACACGAAGCT 

CACCGTGCACACAACCATGGCGACAAAAACCCTATAAAACCCCATCCGATCG 

CCATCATCTCATCATCAGTTCATTACCAACAAACAAAAGAGGAAAAAAAACAT 

ATACACTTCTAGTGATTGTCTGATTGATCATCAATCTAGAGGCGGCCGCATG 

GCTAGCAAGGTCGTCTTCTTCGCGGCGGCGCTCATGGCGGCCATGGTGGC 

CATCTCCGGC 

Rice Gt1 promoter and Gt1 leader coding sequence 


16 


Other monocot maturation specific promoter sequences 
Seq#17 Bx7 promoter seq 

CTGCAGGCCAGGGAAAGACAATGGACATGCAAAGAGGTAGGGGCAGGGAA 

GAAACACTTGGAGATCATAGAAGAACATAAGAGGTTAAACATAGGAGGGCAT 

AATGGACAATTAAATCTACATTAATTGAACTCATTTGGGAAGTAAACAAAATC 

CATATTCTGGTGTAAATCAAACTA I I I GACGCGGATTTACTAAGATCCTATGT 

TAAI'TITAGACATGACTGGCCAAAGGTTTCAGTTAGTTCATTTGTCACGGAAA 

GGTG I I I I CA I AAG I CCAAAAC I CI ACCAAC TTTTTTGCACGTCATAGCATAG 

ATAGATGTTGTGAGTCATTGGATAGATATTGTGAGTCAGCATGGATTTGTGTT 

GCCTGGAAATCCAACTAAATGACAAGCAACAAAACCTGAAATGGGCTTTAGG 

AGAGATGGTTTATCAATTTACATGTTCCATGCAGGCTACCTTCCACTACTCGA 

CATGGTTAGAAGTTTTGAGTGCCGCATATTTGCGGAAGCAATGGCACTACTC 

GACATGGTTAGAAG I I I I GAGTGCCGCATATTTGCGGAAGCAATGGCTAACA 

GATACATATTCTGCCAAACCCCAAGAAGGATAATCACTCCTCTTAGATAAAAA 

GAACAGACCAATGTACAAACATCCACACTTCTGCAAACAATACACCAGAACT 

AGGATTAAGCCCATTACGTGGCTTTAGCAGACCGTCCAAAAATCTGTTTTGC 

AAGCACCAATTGCTCCTTACTTATCCAGCTTC I I I I GTGTTGGCAAACTGCCC 

Tl I ICCAACUGAI H IGH I CTTCTC ACGCTTTCTTC ATAGGCTAAACTAACCT 

CGGCGTGCACACAACCATGTCCTGAACCTTCACCTCGTCCCTATAAAAGCCC 

ATCCAACCTTACAATCTCATCATCACCCACAACACCGAGCACCCCAATCTAC 

AGATCAATTCACTGACAGTTCACTGATCTAGA 


17- 
23 



120 



BNSDOCiD: <WO 02064750A2J_> 



WO 02/064750 PCT/US02/04909 



Seq#18 Glub-2 promoter seq 

CTGCAGTAATGGATACCTAGTAGCAAGCTAGCTTAAACAAATCTAAATTCCAA 
TCTGTTCGTAAACGTTTTCTCGATCGCAATTTTGATCAAAACTATTGAAAACC 
TCAATTAAACCATTCAAAATTTTTAATATACCCAACAAGAGCGTCCAAACCAA 
ATATGTAAATATGGATGTCATGATAATTGACTTATGACAATGTGATTATTTCAT 
CAAGTCTTTAAATCATTAATTCTAGTTGAAGGTTTATGTTTTCTTATGCTAAAG 
GGTTATGTTTATATAAGAATATTAAAGAGCAAATTGCAATAGATCAACACAAC 
AAATTTGAATGTTTCCAGATGTGTAAAAATATCCAAATTAATTGTTTTAAAATA 
GTTTTAAGAAGGATCTGATATGCAAGTTTGATAGTTAGTAAACTGCAAAAGGG 
CTTATTACATGGAAAATTCCTTATTGAATATGTTTCATTGACTGGTTTATTTTA 
CATGACAACAAAGTTACTAGTATGTCAATAAAAAAATACAAGGTTACTTGTCA 
ATTGTATTGTGCCAAGTAAAGATGACAACAAACATACAAATTTATTTGTTCTTT 
TATAGAAACACCTAACTTATCAAGGATAGTTGGCCACGCAAAAATGACAACAT 
ACTTTACAATTGTATCATCATAAAGATCTTATCAAGTATAAGAACTTTATGGTG 
ACATAAAAAATAATCACAAGGGCAAGACACATACTAAAAGTATGGACAGAAAT 
TTCTTAACAAACTCCATTTGTTTTGTATCCAAAAGCATAAGAAATGAGTCATG 
GCTGAGTCATGATATGTAGTTCAATCTTGCAAAATTGCC I I I I I GTTAAGTATT 
GTTTTAACACTACAAGTCACATATTGTCTATACTTGCAACAAACACTATTACC 
GTGTATCCCAAGTGGCCTTTTCATTGCTATATAAACTAGCTTGATCGGTCTTT 
CAACTCACATCAATTAGCTTAAGTTTCCATTAGCAACTGCTAATAGCT 

Seq#19 Gt3 promoter seq 

CTGCAGTGTAAGTGTAGCTTCTTATAGCTTAGTGCTTTACTATCTTCACAAGC 
ACATGCTATAGTATTGTTCCAAGATGAAAGAATAATTCATCCTTGCTACCAAC 
TTGCATGATATTATATTTGTGAATATCCTATCTCTTGGCTTATAATGAAATGTG 
CTGCTGGGTTATTCTGACCATGGTATTTGAGAGCCTTTGTATAGCTGAAACC 
AACGTATATCGAGCATGGAACAGAGAACAAAATGCAAGGAI I I I I 1'IATTCTG 
GTTCATGCCCTGGATGGGTTAATATCGTGATCATCAAAAAAGATATGCATAAA 
ATTAAAGTAATAAATTTGCTCATAAGAAACGAAAACCAAAAGCACATATGTCC 
TAAACAAACTGCATTTTGTTTGTCATGTAGCAATACAAGAGATAATATATGAC 
GTGGTTATGACTTATTCACTTTTTGTGACTCCAAAATGTAGTAGGTCTAACTG 
ATTGTTTAAAGTGATGTCTTACTGTAGAAGTTTCATCCCAAAAGCAATCACTA 
AAGCAACACACACGTATAGTCCACCTTCACGTAATTCTTTGTGGAAGATAACA 
AGAAGGCTCACTGAAAAATAAAAGCAAAGAAAAGGATATCAAACAGACCATT 
| GTGCATCCCATTGATCCTTGTATGTCTATTTATCTATCCTCCTTTTGTGTACCT 
TACTTCTATCTAGTGAGTCACTTCATATGTGGACATTAACAAACTCTATCTTAA 
CATCTAGTCGATCACTACTTTACTTCACTATAAAAGGACCAACATATATCATC 
CATTTCTCACAAAAGCATTGAGTTCAGTCCCACAAAATCTAGA 

Seq #20 Glub-1 promoter seq 

CTGCAGAGATATGGATTTTCTAAGATTAATTGATTCTCTGTCTAAAGAAAAAA 

AGTATTATTGAATTAAATGGAAAAAGAAAAAGGAAAAAGGGGATGGCTTCTG 

CTTTTTGGGCTGAAGGCGGCGTGTGGCCAGCGTGCTGCGTGCGGACAGCG 

AGCGAACACACGACGGAGCAGCTACGACGAACGGGGGACCGAGTGGACCG 

GACGAGGATGTGGCCTAGGACGAGTGCACAAGGCTAGTGGACTCGGTCCC 

CGCGCGGTATCCCGAGTGGTCCACTGTCTGCAAACACGATTCACATAGAGC 

GGGCAGACGCGGGAGCCGTCCTAGGTGCACCGGAAGCAAATCCGTCGCCT 

GGGTGGATTTGAGTGACACGGCGCACGTGTAGCCTCACAGCTCTCCGTGGT 

CAGATGTGTAAAATTATCATAATATGTGTTTTTCAAATAGTTAAATAATATATAT 

AGGCAAGTTATATGGGTCAATAAGCAGTAAAAAGGCTTATGACATGGTAAAA 

TTACTTACACCAATATGCCTTACTGTCTGATATATTTTACATGACAACAAAGTT 

ACAAGTACGTCATTTAAAAATACAAGTTACTTATCAATTGTAGTGTATCAAGTA 

AATGACAACAAACCTACAAATTTGCTATTTTGAAGGAACACTTAAAAAAATCA 

ATAGGCAAGTTATATAGTCAATAAACTGCAAGAAGGCTTATGACATGGAAAAA 

TTACATACACCAATATGCTTTATTGTCCGGTATATTTTACAAGACAACAAAGTT 
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ATAAGTATGTCATTTAAAAATACAAGTTACTTATCAATTGTCAAGTAAATGAAA 

ACAAACCTACAAATTTGTTATTTTGAAGGAACACCTAAATTATCAAATATAGCT 

TGCTACGGAAAATGACAACATGCTTACAAGTTATTATCATCTTAAAGTTAGAC 

TCATCTTCTGAAGCATAAGAGCTTTATGGTGCAAAAACAAATATAATGACAAG 

GCAAAGATACATACATATTAAGAGTATGGACAGACATTTCTTTAACAAACTCC 

ATTTGTATTACTCCAAAAGCACCAGAAGTTTGTCATGGCTGAGTCATGAAATG 

TATAGTTCAATCTTGCAAAGTTGCCTTTCCTTTTGTACTGTGTTTTAACACTAC 

AAGCCATATATTGTCTGTACGTGCAACAAACTATATCACCATGTATCCCAAGA 

TGC I I I I I I ATTGCTATATAAACTAGCTTGGTCTGTCTTTGAACTCACATCAAT 

TAGCTTAAGTTTCCATAAGCAAGTACAAATAGCTCTAGA 

Seq #21 Rice proalmin promoter seq 

CTGCAGCATCGGCTTAGGTGTAGCAACACGACTTTATTATTATTATTATTATT 

ATTATTATTATTTTACAAAAATATAAAATAGATCAGTCCCTCACCACAAGTAGA 

GCAAGTTGGTGAGTTATTGTAAAGTTCTACAAAGCTAATTTAAAAGTTATTGC 

ATTAACTTATTTCATATTACAAACAAGAGTGTCAATGGAACAATGAAAACCAT 

ATGACATACTATAATTTTG I I I I I ATTATTGAAATTATATAATTCAAAGAGAATA 

AATCCACATAGCCGTAAAGTTCTACATGTGGTGCATTACCAAAATATATATAG 

CTTACAAAACATGACAAGCTTAGTTTGAAAAATTGCAATCCTTATCACATTGA 

CACATAAAGTGAGTG ATGAGTCATAATATTA I I I I t CTTGCTACCCATCATGT 

ATATATGATAGCCACAAAGTTACTTTGATGATGATATCAAAGAACATTTTTAG 

GTGCACCTAACAGAATATCCAAATAATATGACTCACTTAGATCATAATAGAGC 

ATCAAGTAAAACTAACACTCTAAAGCAACCGATGGGAAAGCATCTATAAATAG 

ACAAGGACAATGAAAATCCTCATCATCCTTCACCACAATTCAAATATTATAGT 

TGAAGCATAGTAGTAGAATCCAACAACAATCTAGAG 

Seq #22 Rice cysteine peptidase promoter seq 

CCAGGCTTCATCCTAACCATTACAGGCAAGATGTTGTATGAAGAAGGGCGAA 

CATGCAGATTGTTAAACTGACACGTGATGGACAAGAATGACCGATTGGTGAC 

CGGTCTGACAATGGTCATGTCGTCAGCAGACAGCCATCTCCCACGTCGCGC 

CTGCTTCCGGTGAAAGTGGAGGTAGGTATGGGCCGTCCCGTCAGAAGGTGA 

TTCGGATGGCAGCGATACAAATCTCCGTCCATTAATGAAGAGAAGTCAAGTT 

GAAAGAAAGGGAGGGAGAGATGGTGCATGTGGGATCCCCTTGGGATATAAA 

AGGAGGACCTTGCCCACTTAGAAAGGAGAGGAGAAAGCAATCCCAGAAGAA 

TCGGGGGCTGACTGGCACTTTGTAGCTTCTTCATACGCGAATCCACCAAAAC 

ACAGGAGTAGGGTATTACGCTTCTCAGCGGCCCGAACCTGTATACATCGCC 

CGTGTCTTGTGTGTTTCCGCTCTTGCGAACCTTCCACAGATTGGGAGCTTAG 

AACCTCACCCAGGGCCCCCGGCCGAACTGGCAAAGGGGGGCCTGCGCGGT 

CTCCCGGTGAGGAGCeCCACGCTCCGTCAGTTCTAAATTACCCGATGAGAA 

AGGGAGGGGGGGGGGGGAAATCTGCCTTGTTTATTTACGATCCAACGGATT 

TGGTGGACACCGATGAGGTGTCTTACCAGTTACCACGAGCTAGATTATAGTA 

CTAATTACTTGAGGATTCGGTTCCTAATTTTTTACCCGATCGACTTCGCCATG 

GAAAAI I I I I I ATTCGGGGGAGAATATCCACCCTGTTTCGCTCCTAATTAAGA 

TAGGAATTGTTACGATTAGCAACCTAATTGAGATCAGAATTGTTAGTTAGCGG 

CGTTGGATCCCTCACCTCATCCCATCCCAATTCCCAAACCCAAACTCCTCTT 

CCAGTCGCCGACCCAAACACGCATCCGCCGCCTATAAATCCCACCCGCATC 

GAGCCTATCAAGCCCAAAAAACCACAAACCAAACGAAGAAGGAAAAAAAAAG 

GAGGAAAAGAAAAGAGGAGGAAAGCGAAGAGGTTGGAGAGAGACGCTCGT 

CTCCACGTCGCCGCC 

Seq #23 Barley D-Hordian promoter 

CTTCGAGTGCCCGCCGATTTGCCAGCAATGGCTAACAGACACATATTCTGCC 

AAAACCCCAGAACAATAATCACTTCTCGTAGATGAAGAGAACAGACCAAGAT 

ACAAACGTCCACGCTTCAGCAAACAGTACCCCA GAAC TAGGATTAAGCCGAT 

TACGCGGCTTTAGCAGACCGTCCAAAAAAACTGTTTTGCAAAGCTCCAATTC 

CTCCTTGCTTATCCAATTTCTTTTGTGTTGGCAAACTGCACTTGTCCAACCGA 
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fill 1 G ITCTTCCCGTGTTTCTTCTTAGGCTAACTAACACAGCCGTGCACATAG 
CCATGGTCCGGAATCTTCACCTCGTCCCTATAAAAGCCCAGCCAATCTCCAC 
AATCTCATCATCACCGAGAACACCGAGAACCACAAAACTAGAGATCAATTCA 
TTGACAGTCCACCG 

Rice Gib promoter and Gt1 leader coding sequence 




Other storage body leader sequences 
Bx7 #24 bx7 signal peptide seq 

ATGGCTAAGCGCCTGGTCCTCTTTGCGGCAGTAGTCGTCGCCCTCGTGGCT 
CTCACCGCC 

Seq #25 Glub-2 signal peptide seq 

ATGGCAACTACCA I I I I C I CI CG II I I I C I A! A I AC Mil GTGCTATGCTATTA 
TGCCAGGGTTCTATGGCC 

Seq #26 Gt3 signal peptide seq 

ATGTGGACATTAACAAACTCTATCTTAACATCTAGTCGATCACTACTTTACTTC 
ACTATAAAAGGACCAACATATATCATCCATT 

Seq #27 Glub-1 signal peptide seq 

ATGGCGAGTTCCG I I I I C I C I CGG I I I I C I A I A I AC I I I 1 GTGTTCTTCTATTA 
TGCCATGGTTCTATGGCC 

Seq #28 proalmin signal peptide seq 

ATGAAGATCA 1 1 1 ICGTATTTGCTCTCCTTGCTATTGTTGCATGCAACGCTTC 
TGCACGGTTTGATGCT 

Seq #29 Rice cysteine peptidase signal peptide seq 

ATGGCCGCCCGCGCCGCCGCCGCCGCGTTCCTGCTGCTGCTCATCGTCGT 
TGGTCACCGCGCC 

Seq #30 D- Hodian signal peptide 

ATGGCTAAGCGGCTGGTCCTCTTTGTGGCGGTAATCGTCGCCCTCGTGGCT 
CTCACCACCGCCOther monocot maturation specific promoter sequences 


24- 
30 


02 transcription factor sequence 

ATGGAGCACGTCATCTCAATGGAGGAGATCCTCGGGCCCTTCTGGGAGCTG 

CTACCACCGCCAGCGCCAGAGCCAGAGCGAGAGCAGCCTCCGGTAACCGG 

CATCGTCGTCGGCAGTGTCATAGACGTTGCTGCTGCTGGTCATGGTGACGG 

GGACATGATGGATCAGCAGCACGCCACAGAGTGGACCTTTGAGAGGTTACT 

AGAAGAGGAGGCTCTGACGACAAGCACACCGCCGCCGGTGGTGGTGGTGC 

CGAACTCTTGTTGCTCAGGCGCCCTAAATGCTGACCGGCCGCCGGTGATGG 

AAGAGGCGGTAACTATGGCGCCTGCGGCGGTGAGTAGTGCCGTAGTAGGT 

GACCCCATGGAGTACAATGCCATACTGAGGAGGAAGCTGGAGGAGGACCTC 

GAGGCCTTCAAAATGTGGAGGGCGGCCTCCAGTGTTGTGACCTCAGATCAA 

CGTTCTCAAGGCTCAAACAATCACACTGGAGGTAGCAGCATCAGGAATAATC 

CAGTGCAGAACAAGCTGATGAACGGCGAAGATCCAATCAACAATAACCACG 

CTCAAACTGCAGGCCTTGGCGTGAGGCTTGCTACTAGCTCTTCCTCGAGAG 

ATCCTTCACCATCAGACGAAGACATGGACGGAGAAGTAGAGATTCTGGGGT 

TCAAGATGCCTACCGAGGAAAGAGTGAGGAAAAGAAAGGAATCCAATAGAG 

AATCAGCCAGACGCTCGAGATACAGGAAAGCCGCTCACCTGAAAGAACTGG 

AAGACCAGGTAGCACAGCTAAAAGCCGAGAATTCTTGCCTGCTGAGGCGCA 

TTGCCGCTCTGAACCAGAAGTACAACGACGCTAACGTCGACAACAGGGTGC 
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TGAGAGCGGACATGGAGACCCTAAGAGCTAAGGTGAAGATGGGAGAGGACT 

CTCTGAAGCGGGTGATAGAGATGAGCTCATCAGTGCCGTCGTCCATGCCCA 

TCTCGGCGCCGACCCCCAGCTCCGACGCTCCAGTGCCGCCGCCGCCTATC 

CGAGACAGCATCGTCGGCTACTTCTGCGCCACAGCCGCAGACGACGATGCT 

TCGGTCGGCAACGGTTTCTTGCGACTGCAAGGTCATCAAGAGCCTGCATCC 

ATGGTCGTCGGTGGAACTCTGAGCGCCACAGAGATGAACCGAGTAGCAGCA 

GCCACGCATTGCGCGGGGGCCATGGAGCACATCCAGACGGCGATGGGATC 

CATGCCGCCGACCTCCGCCTCCGGATCTACACCGCCGCCGCAGGATTATGA 

GCTGCTGGGTCGAAATGGGGCCATACACATGGACATGTATTAG 




PBF transcription factor sequence 

ATGGACATGATCTCCGGCAGCACTGCAGCAACATCAACACGCCACAACAAC 

CAACAGGCGGTGATGTTGTCATCCCCCATTATAAAGGAGGAAGCTAGGGAC 

CCAAAGCAGACACGAGCCATGCCCCAAATAGGTGGCAGTGGGGAGCGTAA 

GCCGAGGCCGCAACTACCTGAGGCGCTCAAGTGCCCACGCTGCGACTCCA 

ACAACACCAAG Mil GCTACTACAACAATTATAGCATGTCACAACCACGCTAC 

III! GCAAGGCTTGCCGCCGCTATTGGACACATGGTGGTACCCTCCGCAAT 

GTCCCCATTGGTGGTGGGTGTCGCAAGAACAAACATGCCTCTAGATTTGTCT 

TGGGCTCTCACACCTCATCGTCCTCATCTGCTACCTATGCACCATTATCCCC 

TAGCACCAACGCTAGCTCTAGCAATATGAGCATCAACAAACATATGATGATG 

GTGCCTAACATGACGATGCCTACCCCAACGACAATGGGCTTATTCCCTAATG 

TGCTCCCAACACTTATGCCGACAGGTGGAGGGGGGGGCTTTGACTTCACTA 

TGGACAACCAACATAGATCATTGTCCTTCACACCAATGTCTCTACCTAGCCA 

GGGGCCAGTGCCTATGCTGGCTGCAGGAGGGAGTGAGGCAACACCGTCTT 

TCCTAGAGATGCTGAGAGGAGGGATTTTTCATGGTAGTAGTAGCTATAACAC 

AAGTCTCACGATGAGTGGTGGCAACAATGGAATGGACAAGCCAI 1 1 ICGCTG 

CCATCATATGGTGCAATGTGCACAAATGGGTTGAGTGGCTCAACCACTAATG 

ATGCCAGACAACTGGTGGGGCCTCAGCAGGATAACAAGGCCATCATGAAGA 

GCAGTAATAACAACAATGGTGTATCATTGTTGAACCTCTACTGGAACAAGCA 

CAACAACAACAACAACAACAACAACAACAACAACAACAACAACAACAACAAG 

GGACAATAA 


32 


Reb transcription factor sequence 

ATGGAGCGGGTGTTCTCCGTGGAGGAGATCTCCGACCCATTCTGGGTCCCG 

CCTCCGCCGCCGCAGTCGGCGGCGGCGGCCCAGCAGCAGGGCGGCGGCG 

GCGTGGCTTCGGGAGGTGGTGGTGGTGTAGCGGGGGGCGGCGGCGGCGG 

GAACGCGATGAACCGGTGCCCGTCGGAGTGGTACTTCCAGAAG 1 1 ICTGGA 

GGAGGCGGTGCTCGATAGCCCCGTCCCGAACCCTAGCCCGAGGGCCGAAG 

CGGGAGGGATCAGGGGCGCAGGAGGGGTGGTGCCGGTCGATGTTAAGCA 

GCCGCAGCTCTCGGGGGCGGCGACGACGAGCGCGGTGGTGGACCCCGTG 

GAGTACAACGCGATGCTGAAGCAGAAGCTGGAGAAGGACCTCGCCGCGGT 

CGCCATGTGGAGGGTACAGCCATTCTCCCCCCCTCTAGTACTCGAGAGCTT 

ACTGAGATCGGCAATGCTAGCTACTGTTTGCATCGAATGTTTATAGGTATTTA 

GATGGGGCATTTCTATAGACCAATGGCGTCCATGGTCTTGCAATGCGCTCTG 

TTGAGTGTCGGTGGTTGGTTCGACTCATAGTATGTAGGGTTGTGCGTATGTA 

CAAACGGAAGCTTCATAGACCTCGGTATTGAGATTGCGATATCGATGCAACC 

TGCGAATTGGCGATGTAATCAGTCATATTCTTACTAAACTGCGAGACAGTGG 

TTTGTTTGCAATTGCAATA 1 1 1 1 1 GTATGGGGCTGCTTAAACTGTCATTGGCT 

1 1 1 lAGATTGGCAATATGTGACTTTATGCAAGTATTTGATTGGGCGGATCCAG 

GAACAAAAAGTTGGGGGGATTCAACATACCGAGTACACTGGCATAAACACAT 

CATCTCAGTATTAAACTATGCTAAAATGCTATTAAGAGACCTTTAGCACCTCT 

TATCTTATCAACCATGGTGAAAAAATTGAAGGGGGGACTCAGGGGGGTATCC 

ATGGGTCCGATGGGTGCAGGGGGGACTGAGTCCCCCCTGCACCCACGTTG 

AATCGGCCCTGGCATGCGTATAAGCTGTCACAGCCA 1 1 1 CTAGGTGCTT GTG 

CTTAGTTGGGTGATGTCAGCTTAATTTGTCTTTI CT ATGTCGTCA1UGA 1 1 1 1 

CTAAGAAACGAAAAATAGCCTATTTATGTGCTCCAGAATTTGATGATCCCTGG 

CCCTTCATTTGCTGAAATTAGCCTATTTGTTGGTTGCCCTTCAG 1 1 1 1 1 I CCC 

AGCTTATGTTGTTGCAATGTGTGGCTATGGCTCGI 1 1 1 GTGCCCTATAATTTA 

TTATTTGCAATTCATTTTTGTACATGACTTAAAATGACACTAGAGCAACATGCA 

CTGATTGGTTATCCTATAATCATTTATGTAGTTCTGTTCAI 1 1 1 ATCATGCTAG 
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| CTCATGTCATTTTCATCTTCAGGCCTCTGGCACAGTTCCACCTGAGCGTeCT 
GGAGCTGGTTCATCCTTGCTGAATGCAGATGTTTCACACATAGGCGCTCCTA 
ATTCCATCGGAGGTACTTATCTTATCTGGTTACATTTTCAGATTGTTATGAAA 
CTACCCAAATATCCTGCACAATTGCATGGGATTAAATTTTAGTTTCTrTGAAAT 
AGAAGTAGAGTTGTATTGCTGTCACGTCATCAAATAGTTCTGAAGCTATGAAT 
AAATAAGTTCCGCATTTGTTAGTGATTCTTTGAACATTAGAATTGTTATGCTTA 
AGTAGATAGGGTTATGTTTGTTTGGAGTTCCCTTAAATCATTTCATTGCTGAC 
TGCCAGCTGGCAGGAGCATTTGTTGTTGCCTTGACCATGAATGAAGACCTTC 
CTGTTCTGAGTGCTCACAAGAAAACATATTTTGATTAATGCACCTTGAATCCT 
TAGGATCTTGCAAAGATGGGCACTTAGCTTTAGAATTGAGTAGTACTTAAATA 
GCTGTTGTTATCATGATTTGTCCTGTAGTGAAATGTCGACAAAACAGGAATG 
CTACTTTTGAC1TCTGATATTTCATGCCTGGCTTTACTTATGCTCTGTTTGGAA 
CATGGGCACATATCAGGCAATGCTACTCCAGTTCAAAACATGCTAAGTGGCC 
CAAGTGGGGGATCGGGCTCACAGTTGGTACAGAATGTTGATGTCCTTGTAAA 
GCAGCCCACCAGCTCTTCATCAAGGGAGCAGTCAGATGATGATGACATGAA 
GGGAGAAGCTGAGACCACTGGAACTGCAAGACCTGCTGATCAAAGATTACA 
ACGAAGGTGATCATTCATTGCTTCCTTGTAATATAGATTCTGTACATAATTAA 
CCTACCTCGTCATGCATGCATGTGTCCTATTTTCACCTTAGCCCTTTCAGTTG 
GATTTCCACTTTCATCCGGTAGCCTTTCAGTTTCCTATTGCATCGCATATATG 
ATCTTTTACCTACCATATTAGTTCTCTGTGTGCCATACTCAGTGCTTAGTGTC 
TCGAGCAAGAGAGGAATTTGTATGGCTATTACACGTAGCACTTTGCTCTCTA 
CTTGTTTATTGACATAAGCAATTTGGGATGAATTAAATCTGAGTTCACATCAT 
ATTCCTTATGTCACAAGTTTCTGAAACCGATTGTATCTAGTATCTGGTTGATG 
CACCCCCATCTTGGATTTGCAAATCAAAGTTATACTCCCTAGAGAGCTTTACC 
TTTCATAAAGCAATTACCCCAATAAACCACGGATTTGATAGCTATTGACTATG 
AWACCAGAATTCATTTGGCAGCTATTTTCTCAATTTAAGTTTGGTATTAGTCT 
CAGTTGGCTGTAAAATAATGTCACGGTAGGGTACATGTATGTGCAGCATACA 
AGGTATGGGTGAGTTATGATATGGACAGTGTGTACACCCCACATTTGCTCAC 
TAAAATCAAAATATTCAAACGTCACGTGATGATATGGTGGATTGCATTATACC 
TTGTATTGTTTATTATGTTACTTGTGCTAGACAATAATATAGGCTGTTCTTTTG 
GGTGATTTTGTATGAAGATGTTGAGCAAGCACTTCTCGATATAATGCTAGTTT 
TGTTGACCTGTTCCAGGAAGCAATCCAATCGGGAGTCAGCCAGGCGCTCAA 
GAAGCAGAAAGGCAGCTCACTTGAATGAGCTGGAGGCACAGGTGTGATAGT 
TCACATAGTTATTTTCGATAAGACATAAAATCCTAAATTACTGGCTACTGACTT 
CAGTTATGGATTTACTTGTTACAGGTATCGCAATTAAGAGTCGAGAACTCCTC 
GCTGTTAAGGCGTCTTGCTGATGTTAACCAGAAGTACAATGATGCTGCTGTT 
GACAATAGAGTGCTAAAAGCAGATGTTGAGACCTTGAGAGCAAAGGTATGCT 
ATATATGCCTTTTGCAATATGCATCCCATGGATTGCTACTTTGGCTTGTTTCA 
AACTTTCAACGTGACTTGTGTACCCTGTTATTAGAAGAATAATCCCGCCTACC 
ATTATACTCTATAAATCACCATTTGGCCAGTCCAAACATGATTATTAAATCAG 
GTCAATCTGAACATTGAAATGTATCAAAAATTCGCAGGTGAAGATGGCAGAG 
GACTCGGTGAAGCGGGTGACAGGCATGAACGCGTTGTTTCCCGCCGCTTCT 
GATATGTCATCCCTCAGCATGCCATTCAACAGCTCCCCATCTGAAGCAACGT 
CAGACGCTGCTGTTCCCATCCAAGATGACCCGAACAATTACTTCGCTACTAA 
CAACGACATCGGAGGTAACAACAACTACATGCCCGACATACCTTCTTCGGCT 
CAGGAGGACGAGGACTTCGTCAATGGCGCTCTGGCTGCCGGCAAGATTGG 
CCGGCCAGCCTCGCTGCAGCGGGTGGCGAGCCTGGAGCATCTCCAGAAGA 
GGATGTGCGGTGGGCCGGCTTCGTCTGGGTCGACGTCCTGA 
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WHAT IS CLAIMED IS : 

1 . A method of producing a heterologous polypeptide in a grain of a plant, 
comprising the steps of: 

(a) culturing a transformed plant to form a grain-producing transformed plant; 

and 

(b) recovering transgenic grains containing the heterologous polypeptide 
from the grain-producing transformed plant; 

wherein the transformed plant comprises a first chimeric gene; 
wherein the first chimeric gene comprises (i) a rice glutelin Gt1 promoter; (ii) a 
nucleic acid leader encoding a rice glutelin Gt1 signal peptide; and (iii) a heterologous 
nucleic acid that encodes the heterologous polypeptide, all operably linked to enable 
expression of the heterologous polypeptide. 

2. A method of producing a heterologous polypeptide in a grain of a plant, 
comprising the steps of: 

(a) culturing a transformed plant to form a grain-producing transformed plant; 

and 

(b) recovering transgenic grains containing the heterologous polypeptide 
from the grain-producing transformed plant; 

wherein the transformed plant comprises a first chimeric gene; 
wherein the first chimeric gene comprises (i) a rice globulin Gib promoter; (ii) a 
nucleic acid leader encoding a rice glutelin Gt1 signal peptide; and (iii) a heterologous 
nucleic acid that encodes the heterologous polypeptide, all operably linked to enable 
expression of the heterologous polypeptide. 

3. The method of claim 1 or 2, wherein the method further comprises the 
steps of transforming a plurality of plant cells; culturing a plurality of transformed plant 
cells to form a plurality of grain-producing transformed plants; and selecting for further 

30 culturing a transformed plant that exhibits high level expression of the heterologous 
polypeptide in its grains. 

4. The method of claim 1 or 2, wherein the transformed plant that is cultured 
to form a grain-producing transformed plant is generated from a grain of a parent plant 

35 that has been transformed with the first chimeric gene. 
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5. The method of claim 1 or 2, wherein the transformed plant further 
comprises a transcription factor-encoding nucleic acid that encodes at least one 
heterologous transcription factor that is capable of enhancing expression of the 

5 heterologous polypeptide. 

6. The method of claim 5, wherein the heterologous transcription factor is 
selected from the group consisting of 02 and PBF and an active fragment thereof. 

10 7. The method of claim 2, wherein the transformed plant further comprises a 

transcription factor-encoding nucleic acid that encodes at least one heterologous 
transcription factor selected from the group consisting of Reb, 02 and PBF, and an 
active fragment thereof. 

15 8. The method of claim 5, wherein the transformed plant further comprises 

one or more transcription factor-encoding nucleic acids that encode two heterologous 
transcription factors that are capable of enhancing expression of the heterologous 
polypeptide. 

20 9. The method of claim 8, wherein the two heterologous transcription factors 

are 02 and PBF. 

10. The method of claim 1 or 2, wherein the heterologous nucleic acid that 
encodes the heterologous polypeptide is codon optimized for expression in a dicot or a 

25 monocot. 

1 1 . The method of claim 1 or 2 t wherein the plant is a dicot. 

12. The method of claim 1 or 2, wherein the plant is a monocot. 

30 

13. The method of claim 12, wherein the monocot is rice, wheat or barley. 

14. The method of claim 1 or 2, wherein the heterologous polypeptide is a 
non-plant storage polypeptide. 

35 
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15. The method of claim lor 2, wherein the heterologous polypeptide is 
selected from the group consisting of: antibodies, cytokines, lymphokines, chemokines, 
hormones, growth factors, coagulation factors, anti-infectives, and cytotoxins. 

5 16. The method of claim 1 or 2, wherein the heterologous polypeptide is a 

human or non-human animal polypeptide. 

17. The method of claim 1 or 2, wherein the heterologous polypeptide is a 
milk polypeptide. 

10 

18. The method of claim 1 or 2, wherein the heterologous polypeptide is 
other than a milk polypeptide. 

19. The method of claim 1 or 2, wherein the heterologous polypeptide is an 
15 anti-inflammatory molecule. 

20. The method of claim 1 or 2, wherein the heterologous polypeptide is 
intestinal trefoil factor (ITF) or an active fragment thereof. 

20 21 . The method of claim 1 or 2, wherein the heterologous polypeptide is IgA 

or an active fragment thereof. 

22. The method of claim 1 or 2, wherein the heterologous polypeptide is 
selected from the group consisting of: iactoferrin, lysozyme, lactoferricin, intestinal trefoil 
25 factor (ITF), epidermal growth factor (EGF), keratinocyte growth factor (KGF), insulin- 
like growth factor I (IGF-I), lactohedrin, kappa-casein, haptocorrin, lactoperoxidase, 
alpha-1 -antitrypsin, immunoglobulins, alpha-lactalbumin, beta-Iactogiobulin, alpha- 
casein, beta-casein, albumin, fibrinogen, and a protease inhibitor. 

30 23. The method of claim 1 , wherein the first chimeric gene is selected from 

the group consisting of: 

(a) a chimeric gene comprising a rice glutelin Gt1 promoter, a rice glutelin 
Gt1 signal peptide, and a codon optimized lysozyme coding sequence; 

(b) a chimeric gene comprising a rice glutelin Gt1 promoter, a rice glutelin 
35 Gt1 signal peptide, and a codon optimized Iactoferrin coding sequence; 
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(c) a chimeric gene comprising a rice giutelin Gt1 promoter, a rice glutelin 
signal peptide, and a codon optimized EGF; 

(d) a chimeric gene comprising a rice glutelin Gt1 promoter, a rice glutelin 
signal peptide, and a codon optimized IGF-I; 

(e) a chimeric gene comprising a rice glutelin Gt1 promoter, a rice glutelin 
signal peptide, and a codon optimized haptocorrin; 

(f) a chimeric gene comprising a rice giutelin Gt1 promoter, a rice glutelin 
signal peptide, and codon optimized IGF-I; 

(g) a chimeric gene comprising a rice giutelin Gt1 promoter, a rice glutelin 
signal peptide, and a codon optimized lactahedrin; 

(h) a chimeric gene comprising a rice glutelin Gt1 promoter, a rice glutelin 
signal peptide, and a codon optimized iactoperoxidase; and 

(i) a chimeric gene comprising a rice glutelin Gt1 promoter, a rice glutelin 
signal peptide, and a codon optimized ot r antitrypsin. 

24. The method of claim 2, wherein the first chimeric gene is selected from 
the group consisting of: (a) a rice globulin Gib promoter, a rice glutelin Gt1 signal 
peptide, and codon optimized haptocorrin; and (b) a rice globulin Gib promoter, a rice 
glutelin Gt1 signal peptide, and codon optimized lysozyme. 

20 

25. The method of claim 1 , 2, 5 or 8, wherein the transformed plant 
comprises a second chimeric gene, wherein the second chimeric gene comprises: (i) an 
aleurone-specific promoter; and (ii) the heterologous nucleic acid encoding the 
heterologous polypeptide, operably linked together to enable expression of the 

25 heterologous polypeptide. 

26. An expression vector comprising a first chimeric gene that comprises: (i) 
a rice glutelin Gt1 promoter; (ii) a nucleic acid leader encoding a rice glutelin Gt1 signal 
peptide; and (iii) a heterologous nucleic acid that encodes a heterologous polypeptide, 

30 all operably linked to enable expression of the heterologous polypeptide. 

27. An expression vector comprising a first chimeric gene that comprises: (i) 
a rice globulin Gib promoter; (ii) a nucleic acid leader encoding a rice glutelin Gtt signal 
peptide; and (iii) a heterologous nucleic acid that encodes a heterologous polypeptide, 

35 all operably linked to enable expression of the heterologous polypeptide. 
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28. The expression vector of claim 26 or 27, further comprising a selectable 
marker. 

29. The expression vector of claim 28, wherein the selectable marker is an 
5 antibiotic resistance gene. 

30. The expression vector of claim 28, wherein the heterologous polypeptide 
is a non-plant storage polypeptide. 

10 31 . The expression vector of claim 28, wherein the heterologous polypeptide 

is selected from the group consisting of: antibodies, cytokines, lymphokines, 
chemokines, hormones, growth factors, coagulation factors, anti-infectives, and 
cytotoxins. 

1 5 32. The expression vector of claim 28, wherein the heterologous polypeptide 

is a human or non-human animal polypeptide. 

33. The expression vector of claim 28, wherein the heterologous polypeptide 
is a milk polypeptide. 

20 

34. The expression vector of claim 28, wherein the heterologous polypeptide 
is other than a milk polypeptide. 

35. The expression vector of claim 28, wherein the heterologous polypeptide 
25 is selected from the group consisting of: lactoferrin, lysozyme, lactoferricin, intestinal 

trefoil factor (ITF), epidermal growth factor (EGF), keratinocyte growth factor (KGF), 
insulin-like growth factor I (IGF-I), lactohedrin, kappa-casein, haptocorrin, 
lactoperoxidase, alpha-1 -antitrypsin, immunoglobulins, immunoglobulins, alpha- 
lactalbumen, beta-lactoglobulin, alpha-casein, beta-casein, albumin, and fibrinogen. 

30 

36. The expression vector of claim 28, further comprising a second chimeric 
gene that comprises: (i) an aleurone-specific promoter; and (ii) the heterologous nucleic 
acid that encodes the heterologous polypeptide, operably linked to enable expression of 
the heterologous polypeptide. 

35 
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37. The expression vector of claim 28 or 36, further comprising at least one 
transcription factor-encoding nucleic acid that encodes at least one heterologous 
transcription factor that is capable of enhancing expression of the heterologous 
polypeptide. 

5 

38. A transformed host comprising the expression vector of any of claims 26, 
27, or 36. 

39. The transformed host of claim 38, wherein the host is a plant cell, a plant 
10 or a grain. 

40. The transformed host of claim 38, further comprising at least a first 
transcription factor-encoding nucleic acid that encodes at least a first heterologous 
transcription factor that is capable of enhancing expression of the heterologous 

1 5 polypeptide in the transformed host. 

41 . The transformed host of claim 40, further comprising a second 
transcription factor-encoding nucleic acid that encodes a second heterologous 
transcription factor that is capable of enhancing expression of the heterologous 

20 polypeptide. 

42. The transformed host of any of claims 38, 40 or 41 , wherein the 
transformed host is a dicot. 

25 43. The transformed host of any of claims 38, 40 or 41 , wherein the 

transformed host is a monocot. 

44. A method of making a transformed host cell comprising transfecting a 
host cell with the expression vector of any of claims 26, 27, 36 or 37. 

30 

45. The method of claim 44, further comprising the steps of selecting a 
transformed cell, and culturing the transformed cell to produce a transformed plant. 

46. The method of claim 45, further comprising the step of allowing the 
35 transformed plant to produce transformed grains. 
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47. Transformed grains comprising a first chimeric gene that comprises (i) a 
rice glutelin Gt1 promoter; (ii) a nucleic acid leader encoding a rice glutelin Gt1 signal 
peptide; and (Hi) a heterologous nucleic acid that encodes a heterologous polypeptide, 
afl operably linked to enable expression of the heterologous polypeptide. 

48. Transformed grains comprising a first chimeric gene that comprises (i) a 
rice globulin Gib promoter; (ii) a nucleic acid leader encoding a rice glutelin Gt1 signal 
peptide; and (iii) a heterologous nucleic acid that encodes a heterologous polypeptide, 
all operably linked to enable expression of the heterologous polypeptide. 

49. The transformed grains of claim 47 or 48, further comprising a second 
chimeric gene that comprises (i) an aleurone-specific promoter; and (ii) the heterologous 
nucleic acid that encodes the heterologous polypeptide, operably linked to enable 
expression of the heterologous polypeptide. 

, 50. The transformed grains of any of claims 47, 48 or 49, further comprising 
at least one transcription factor-encoding nucleic acid that encodes at least one 
heterologous transcription factor that is capable of enhancing expression of the 
heterologous polypeptide. 

51 . The transformed grains of any of claims 47, 48, 49 or 50, wherein the 
transformed grains have been wet or dry milled. 

52. The transformed grains of any of claims 47, 48, 49 or 50, wherein the 
25 transformed grains have been subjected to an extraction process. 

53. A heterologous polypeptide produced by the process of claim 1 or 2. 

54. A codon optimized nucleic acid molecule for expression of polypeptides 
30 in monocot, wherein the nucleic acid is selected from the group consisting of: 

(i) a h LF nucleic acid encoding human lactoferrin; 

(ii) a hLZ nucleic acid encoding human lysozyme; 

(iii) a hEGF nucleic acid encoding human epidermal growth factor; 

(iv) hIGF-i nucleic acid encoding human insulin-like growth factor I; 

35 
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55. The codon optimized nucleic acid molecule of claim 52, wherein nucleic 
acid molecule comprises a sequence selected from the group consisting of: (i) 

hLF nucleic acid sequence as shown in SEQ ID NO:3; 
(ii) hLZ nucleic acid sequence as shown in SEQ ID NO:1 ; 
5 (iii) hEGF nucleic acid sequence as shown in SEQ ID NO: 8. 

(iv) hIGF-l nucleic acid sequence as shown in SEQ ID NO: 9 

56. A method of producing a heterologous polypeptide in a grain of a plant, 
comprising the steps of: 

10 (a) culturing a transformed plant to form a grain-producing transformed plant; 

and 

(b) recovering grains containing the heterologous polypeptide from the grain- 
producing transformed plant; 

wherein the transformed plant comprises a first chimeric gene and at least one 
1 5 heterologous transcription factor that is capable of enhancing the expression of the first 
chimeric gene; 

wherein the first chimeric gene comprises: (i) a promoter of a storage protein 
gene; (ii) a nucleic acid leader encoding a signal peptide of a storage protein; and (iii) a 
heterologous nucleic acid that encodes the heterologous polypeptide, all operably linked 
20 to enable expression of the heterologous polypeptide. 

57. The method of claim 56, wherein the transformed plant further comprises 
a second chimeric gene that comprises: (i) an aleurone specific promoter; and (ii) the 
heterologous nucleic acid, all operably linked to enable expression of the heterologous 

25 polypeptide. 

58. The method of claim 56, wherein the storage protein gene is selected 
from the group consisting of: rice gluteKns, rice oryzins, rice prolamines, barley'hordeins, 
wheat gliadins, wheat glutenins, maize zeins, maize glutelins, oat glutelins, sorghum 

30 kafirins, millet pennisetins, and rye secalins. 

59. The method of claim 56, wherein the transformed plant comprises two 
heterologous transcription factors that are capable of enhancing expression of the 
heterologous polypeptide. 

35 
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60. The method of claim 59, wherein the heterologous transcription factor is 
selected from the group consisting of Reb, 02, and PBF. 

61. The method of claim 56, wherein the signal peptide is a rice Gt1 signal 
5 peptide or rice Gib signal peptide. 

62. The method of claim 56, wherein the promoter is a rice Gt1 promoter, a 
rice Gib promoter or a wheat glutenin promoter. 

1 0 63. The method of claim 57, wherein the aleurone specific promoter a 

promoter of the lipid transfer protein gene Ltp1 or a promoter of chitinase gene Chi26. 

64. The method of claim 56, wherein the transformed plant is a dicot. 

1 5 65. The method of claim 56, wherein the transformed plant is a monocot. 

66. The method of claim 65, wherein the monocot is a cereal. 

67. The method of claim 66, wherein the cereal is rice, wheat, or barley. 

20 

68. The method of claim 56, wherein the heterologous polypeptide is 
selected from the group consisting of: antibodies, cytokines, lymphokines, chemokines, 
hormones, growth factors, coagulation factors, anti-infectives, and cytotoxins. 

25 69. The method of claim 56, wherein the heterologous polypeptide is 

selected from the group consisting of: lactoferrin, lysozyrne, lactoferricin, intestinal trefoil 
factor (ITF), epidermal growth factor (EGF), keratinocyte growth factor (KGF), insulin- 
like growth factor I (IGF-I), lactohedrin, kappa-casein, haptocorrin, lactoperoxidase, 
alpha-1 -antitrypsin, immunoglobulins, alpha-lactalbumin, beta-lactoglobulin, alpha- 

30 casein, beta-casein, albumin, and fibrinogen. 

70. An expression vector comprising a first chimeric gene that comprises (i) 
an aleurone-specific promoter and (ii) a heterologous nucleic acid encoding a 
heterologous polypeptide, operably linked to enable expression of the heterologous 
35 polypeptide. 
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71 . The expression vector of claim 70, further comprising a transcription 
factor-encoding nucleic acid that encodes at least one heterologous transcription factor 
that is capable of enhancing expression of the heterologous polypeptide. 

5 72. The expression vector of claim 70 or 71 , further comprising a second 

chimeric gene that comprises: (i) a promoter of a storage protein gene; (ii) a nucleic acid 
leader encoding a signal peptide of a storage protein; and (iii) the heterologous nucleic 
acid encoding the heterologous polypeptide, all operably linked to enable expression of 
the heterologous polypeptide, 

10 

73. A transformed host comprising a first chimeric gene comprising an 
aleurone-specific promoter operably linked to a first heterologous polypeptide to enable 
expression of the heterologous polypeptide. 

15 74. The transformed host of claim 73, further comprising at least one 

heterologous transcription factor. 

75. The transformed host of claim 73 or 74, further comprising a second 
chimeric gene comprising (i) a promoter of a storage protein gene, (ii) a nucleic acid 

20 leader encoding a signal peptide of a storage protein, and (iii) a heterologous 

polypeptide, all operably linked to enable expression of the heterologous polypeptide. 

76. The transformed host of claim 75, further comprising two heterologous 
transcription factors. 

25 

77. A composition comprising a heterologous polypeptide in transformed 
grain, wherein the transformed grain comprises the expression vector of claim 70, 71 or 
72. 

30 78. The composition of claim 77, wherein the transformed grain has been wet 

or dry milled. 

79. The composition of claim 77, wherein the transformed grain has been 
subjected to an extraction process. 

35 
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80. A method for processing a transgenic seed that comprises a 
heterologous polypeptide, comprising the steps of: 

(a) powderizing the transgenic seed in a buffer to form bufferized 
powdered seed; 

5 (b) removing cell debris from the bufferized powdered seed to obtain 

a supernatant; and 

(c) reducing liquid from the supernatant to produce an extract that 
comprises the heterologous polypeptide. 

10 81 . The method of claim 80, wherein the buffer is selected from the group 

consisting of: saline, phosphate buffered saline (PBS), ammonium bicarbonate buffer, 
ammonium acetate buffer, and Tris, with or without iron supplement. 

82. The method of claim 80, wherein the step of removing cell debris is 
1 5 performed by centrifugation, filtration or sedimentation. 

83. The method of 80, wherein the step of reducing liquid from the 
supernatant is performed by lyophilization or spray drying. 

20 84. The method of claim 80, wherein the heterologous polypeptide is active 

after processing. 

85. The method of claim 80, wherein the seed is a cereal grain. 

25 86. The method of claim 80, wherein the seed is a rice, wheat or barley seed. 

87. An extract comprising a plant-produced heterologous polypeptide 
produced by the process of claim 80. 

30 88. The extract of claim 87, wherein the heterologous polypeptide is selected 

from the group consisting of: antibodies, cytokines, iymphokines, chemokines, 
hormones, growth factors, coagulation factors, anti-infectives, cytotoxins and active 
fragments thereof. 
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89. The extract of claim 80, wherein the heterologous polypeptide is a milk 
polypeptide, 

90. A method of processing transgenic grains containing a heterologous 
5 polypeptide to produce a malt syrup comprising the steps of: 

(a) providing transgenic grains that contain starch and heterologous 

polypeptides; 

(b) providing an enzymatic composition for conversion of malt to wort; 

(c) combining the transgenic grains with the enzymatic composition 
10 under conditions that allow for conversion of starch in the transgenic grains to sugars; 

(d) allowing the starch in the transgenic grains to be at least partially 

converted; and 

(e) separating resulting transgenic malt syrup from resulting 
transgenic grain residue, each containing the heterologous polypeptide. 

15 

91 . The method of claim 90, wherein the enzymatic composition comprises 
amylases. 

92. The method of claim 90, wherein the enzymatic composition comprises a 
20 liquid malt extract comprising barley. 

93. A method of processing transgenic grains containing a heterologous 
polypeptide to produce a malt extract comprising the steps of: 

(a) providing transgenic grains that contain starch and heterologous 

25 polypeptides; 

(b) allowing the transgenic grains to undergo a malting process to 
form malted transgenic grains; and 

(c) crushing the malted transgenic grains. 

30 94. The method of claim 93, further comprising the step of mixing the 

crushed and malted transgenic grains with water to obtain a liquid transgenic malt 
extract. 

95. The method of claim 90 or 93, wherein the transgenic grains are cereal 

35 grains. 
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96. The method of claim 95, wherein the cereal grains are selected from the 
group consisting of com, sorghum, rice, barley, rye, wheat, oats and triticale. 

97. The method of claim 90 or 93, wherein the transgenic grains are grains of 

5 a dicot. 

98. A transgenic malt syrup comprising a heterologous polypeptide produced 
in transgenic grains in a malt syrup. 

10 99. A transgenic malt extract comprising a heterologous polypeptide 

produced in transgenic grains in a malt extract. 

100. An edible food comprising a heterologous polypeptide in a transgenic 
malt syrup, a transgenic malt extract, or a transgenic grain residue in a food product. 

15 

1 01 . A protein-containing product comprising: 

a monocot seed fluor, extract, or malt composition containing monocot seed or 
malted-seed components, and one or more seed-produced, non-seed proteins in 
substantially unpurified form, and 
20 a vehicle containing the extract in a form suitable for human or animal use. 

1 02. The product of claim 101 , wherein said vehicle is selected from the group 
consisting of a capsule, binder components effective to tabletize the composition, a 
consumable liquid, and a consumable suspension, 

25 

103. The product of claim 101 , wherein the vehicle is a processed food in 
which the extract is mixed. 

1 04. The product of claim 101 or 1 02, wherein the seed-produced, non-seed 
30 proteins are lysozyme and lactoferrin. 

1 05. The product of claim 101 , wherein the protein is a blood-clotting factor 
and the vehicle is a surgical dressing, powder or cream. 
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106. The product of claim 101 , wherein the protein is an industrial enzyme, the 
extract is in a iiquid or powder form, and the vehicle is a liquid or powder dispenser, 
respectively. 

5 1 07. The product of claim 101, wherein the protein is a detergent enzyme and 

the vehicle is a detergent. 

108. The product of claim 101 , wherein the protein is an anti-infective, and the 
vehicle is selected from the group consisting of: toothpaste, mouthwash, hand soap, 

10 dish soap, dishwasher detergent, tile cleaner, bathroom cleaner, toilet cleaner. 

1 09. The product of claim 101 , wherein the weight ratio of protein: seed 
components is less than 5%. 

15 110. The product of claim 1 01 , wherein the weight ratio of protein: seed 

components is less than 0.5%. 

111. A method for producing a selected protein other than a monocot-plant 
seed-storage protein, comprising the steps of: 

20 (a) obtaining a monocot plant that has been stably transformed with a first 

chimeric gene having (i) a rice glutelin Gt1 promoter or a rice globulin Gib promoter, (ii) 
operably linked to the promoter, a leader DNA encoding a glutelin Gt1 leader sequence, 
and (Hi) a protein-coding sequence encoding the selected protein, and linked in 
translation reading frame with the leader sequence, such that the leader and the 

25 protein-encoding sequences encode a fusion protein composed of the selected protein 
and an N-terminal glutelin Gt1 leader sequence, 

(b) cultivating the plant under seed-maturation conditions, 

(c) harvesting mature seeds from the cultivated plants, and 

(d) extracting from the harvested seeds, an extract containing the selected 

30 protein. 

112. The method of claim 111, wherein the mature seeds contain at least 
about 0.25% weight percent of the selected protein. 
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1 1 3. The method of claim 111, wherein the mature seeds contain at least 
about 0.5% weight percent of the mature seed weight. 

1 14. The method of claim 111, wherein the rice glutelin Gt1 promoter and the 
5 leader DNA encoding a glutelin Gt1 leader have the sequence identified by SEQ ID NO: 

15. 

115. The method of claim 111, wherein the rice globulin Gib promoter and the 
leader DNA encoding a glutelin Gt1 leader have the sequence identified by SEQ ID 

10 NO. 16 

116. The method of claim 111, wherein the step of extracting comprises 
crushing the seeds and suspending the crushed seeds in an aqueous medium buffered 
to physiological pH. 

15 

117. The method of claim 1 1 1 , wherein the step of extracting comprises 
milling mature seeds to produce a seed grain and malting the resulting seed grain. 

118. A plant stably transformed with a chimeric gene comprising (i) a rice 
20 glutelin Gt1 promoter or a rice globulin Gib promoter, (ii) operably linked to said 

promoter, a leader DNA encoding a glutelin Gt1 leader sequence, and (iii) a protein- 
coding sequence encoding the selected protein, and linked in translation reading frame 
with the leader sequence, such that the leader and protein-encoding sequences encode 
a fusion protein composed of the selected protein and an N-terminal glutelin Gt1 leader 
25 sequence. 

119. The plant of claim 118, wherein the plant is further transformed with one 
or more nucleic acid that encode at least one transcription factor that are capable of 
enhancing expression of the selected protein. 
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Sequence Range Egf actor: 4 to 165 

10 20 30 40 50 

Egf actor AAC TCC GAC TCG GAG TGC CCC CTC TCC CAC GAC GGT TAC TGC CTC CAC GAC GGG 
N S D. S E C P L S H D G Y C L H D G> 

3360 3370 3380 3390 3400 

Native Gene AAT GGT GAC TCT GAA TGT CCC CTG TCC CAC GAT GGG TAC TGC CTC CAT GAT GGT 

II 1 1 1 il II II Mill Ml 111 II I I III III III II II II 

Egfactor AAC TCC GAC TCG GAG TGC CCC CTC TCC CAC GAC GGT TAC TGC CTC CAC GAC GGG 

60 70 80 90 100 110 

Egfactor GTC TGC ATG TAC ATC GAG GCC CTC GAC AAG TAC GCC TGC AAC TGC GTC GTG GGC 
VCMY I E A LDKYACNCV V G> 

3410 3420 3430 3440 3450 

Native Gene GTG TGC ATG TAT ATT GAA GCA TTG GAC AAG TAT GCA TGC AAC TGT GTT GTT GGC 

II IN III I I II II I I I III III II II III III II II | | in 
Egfactor GTC TGC ATG TAC ATC GAG GCC CTC GAC AAG TAC GCC TGC AAC TGC GTC GTG GGC 

120 130 140 150 160 

Egfactor TAC ATC GGC GAG CGG TGC GAG TAC CGC GAC CTC AAG TGG TGG GAG CTG CGC TGA 
Y I G ERCQ Y RDLKWWELR*> 

3460 3470 3480 3490 3500 

Native Gene TAC ATC GGG GAG CGA TGT CAG TAC CGA GAC CTG AAG TGG TGG GAA CTG CGC 

I II 1 1 I I I III II II 1 1 1 I 1 1 1 1 1 1 1 I I 1 1 1 I I I II I 1 1 IN Ml 

Egfactor TAC ATC GGC GAG CGG TGC CAG TAC CGC GAC CTC AAG TGG TGG GAG CTG CGC TGA 



Epidermal Growth Factor 

Number of codons in mature peptide: 53 

Number of codons changed: 27(51%) 

Number of nucleotides changed: 30 (1 9%) 
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Sequence Range: 4 to 216 
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10 20 30 40 50 60 70 

GGTACCCATC TAATACATTA ATAACAAGAG AGAGAATGGA TAATGCAATT ATTTATTTTT ATGGGAGGCT 
CCATGGGTAG ATTATGTAAT TATTGTTCTC TCTCTTACCT ATTACGTTAA TAAATAAAAA TACCCTCCGA 

80 90 100 110 120 130 140 

ATATTTTTAT CGGATTTTAG TAAATAACGG GGCAATTCGG TACTTAGGTA AAGCTACGTA TGACTATCGC 
TATAAAAATA GCCTAAAATC ATTTATTGCC CCGTTAAGCC ATGAATCCAT TTCGATGCAT ACTGATAGCG 

150 160 170 180 190 200 210 

TACCGCTACG GTAGTTGAAT TGGAATTCTT CGATAGCATC TGTTGTGTTG TTGCAGTTAG GGTACTTGAA 
ATGGCGATGC CATCAACTTA ACCTTAAGAA GCTATCGTAG ACAACACAAC AACGTCAATC CCATGAACTT 

220 230 240 250 260 270 280 

TAGCTCCAGC CGTGAAAACG AGGGGTTTTC GCAGGTTTTA TAGGATTGCC AAGTTAGACT AGGGCAATTC 
ATCGAGGTCG GCACTTTTGC TCCCCAAAAG CGTCCAAAAT ATCCTAACGG TTCAATCTGA TCCCGTTAAG 

290 300 310 320 330 340 350 

ATGTTCACGG TATTGTGTAG TATATGAAAA AGGAGATCTC CCAAACAATT TATAATTTTG TATAAGGGAG 
TACAAGTGCC ATAACACATC ATATACTTTT TCCTCTAGAG GGTTTGTTAA ATATTAAAAC ATATTCCCTC 

>AT-rich_region_ 
I 

360 370 380 390 400 410 420 

AAATCGAACT TGAGGTGTCT AATTCACCAA CCGAGCTACT CCCTCCGTTT CATATATGTA TATACATATA 
TTTAGCTTGA ACTCCACAGA TTAAGTGGTT GGCTCGATGA GGGAGGCAAA GTATATACAT ATATGTATAT 

430 440 .450 460 470 480 490 

TACGTATATA TACGTATATA CACATATACG TATATACATA TATGGTATAT ACATATATAT ATATATATAT 
ATGCATATAT ATGCATATAT GTGTATATGC ATATATGTAT ATACCATATA TGTATATATA TATATATATA 

500 510 520 530 540 550 560 

ATATATATAT ATGTGTGTGT GTGTATGTGG GGTGGCAATG CTAAAAAGTT TTATAATATG AACGGATGAA 
TATATATATA TACACACACA CACATACACC CCACCGTTAC GATTTTTCAA AATATTATAC TTGCCTACTT 

570 580 590 600 610 620 630 

GTACTATCCA CTAAGTCCCT ATAGTTTTCT GGCACTGTGT AGTATACGAA TGCACAATTA TATCCATAAA 
CATGATAGGT GATTCAGGGA TATCAAAAGA CCGTGACACA TCATATGCTT ACGTGTTAAT ATAGGTATTT 

640 650 660 670 680 690 700 

ATTGATATTA TATATTCGTC GCGACGAAAA TAAAG AC ATA ATATTCGGTA TACCATTTAT CCACGATATA 
TAACTATAAT ATATAAGCAG CGCTGCTTTT ATTTCTGTAT TATAAGCCAT ATGGTAAATA GGTGCTATAT 

710 720 730 740 750 760 770 

TCTAAATTCC ACTGATATAT CTAAATTCCA CTTGATCCCT TTTATGGATA AATTCTGGAT AACAATTACT 
AGATTTAAGG TGACTATATA GATTTAAGGT GAACTAGGGA AAATAC C TAT TTAAGACCTA TTGTTAATGA 

780 790 800 810 820 830 840 

ACCAGCAGTA TATCCTACTA TCAGCGCACT GCACACCAAA CTACCCTCAC CCAGTAGTTA CAAACGCATA 
TGGTGGTCAT ATAGGATGAT AGTCGCGTGA CGTGTGGTTT GATGGGAGTG GGTCATCAAT GTTTGCGTAT 
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850 860 870 880 890 900 910 

TTTTGCCGTT AGTTAATTAT TATCCGGTAA AGAAGGTAAA GAAGATTGGT AGTAATCCAA AATTTTCCCA 
AAAACGGCAA TCAATTAATA ATAGGCCATT TCTTCCATTT CTTCTAACCA TCATTAGGTT TTAAAAGGGT 

920 930 940 950 960 970 980 

ACCCCAACCT CGGAACAAAA ACCGCGTAGT ATTTGTCGTA ACCAGGAGCA TCCGAGTCAT TAATTTACAC 
TGGGGTTGGA GCCTTGTTTT TGGCGCATCA TAAACAGCAT TGGTCCTCGT AGGCTCAGTA ATTAAATGTG 

>Transcription_5tart__site 

>CAAG_site 

I I 
990 1000 1010 1020 1030 1040 1050 

CCAAACACAA AAAATTAGCA GCACGGAGCC GCCTTCCCAA TCCTCTCCTC TCTCCTCTCC TCTTCTCCAA 

GGTTTGTGTT TTTTAATCGT CGTGCGTCGG CGGAAGGGTT AG GAG AG GAG AGAGGAGAGG AGAAGAGGTT 

1060 1070 1080 1090 1100 1110 1120 

GCGGCAATTC GCGCGAGGTT TTCTCCGATC AAACCCTCGA ATCCCCCCCT CGCGAATCCA TCGGAGGGTA 
CGCCGTTAAG CGCGCTCCAA AAGAGGCTAG TTTGGGAGCT TAGGGGGGGA GCGCTTAGGT AGCCTCCCAT 

1130 1140 1150 1160 1170 1180 

GCCCCGCGAT CCGCGTCGGC GAGAGCGGAT TCCGATTCCG CG ATG GAG CGG GTG TTC TCC GTG 
CGGGGCGCTA GGCGCAGCCG CTCTCGCCTA AGGCTAAGGC GC TAC CTC GCC CAC AAG AGG CAC 

M E R V F S V> 
a a EXON 1 a a > 

1190 1200 1210 1220 1230 1240 
GAG GAG ATC TCC GAC CCA TTC TGG GTC CCG CCT CCG CCG CCG CAG TCG GCG GCG GCG 
CTC CTC TAG AGG CTG GGT AAG ACC CAG GGC GGA GGC GGC GGC GTC AGC CGC CGC CGC 
E E I S D P F W V P P P P P Q S A A A> 
a a a a a a a a EXON1 a a_ a a a a a a > 

1250 1260 1270 1280 1290 
GCC CAG CAG CAG GGC GGC GGC GGC GTG GCT TCG GGA GGT GGT GGT GGT GTA GCG GGG 
CGG GTC GTC GTC CCG CCG CCG CCG CAC CGA AGC CCT CCA CCA CCA CCA CAT CGC CCC 
A QQQGGG GVASG G G G G V A G> 
a a a a a a a__ a EXON1 a a a a _a a a a > 

1300 1310 1320 1330 1340 1350 
GGC GGC GGC GGC GGG AAC GCG ATG AAC CGG TGC CCG TCG GAG TGG TAC TTC CAG AAG 
CCG CCG CCG CCG CCC TTG CGC TAC TTG GCC ACG GGC AGC CTC ACC ATG AAG GTC TTC 
G G G G G N A M N R C P S E W Y F Q K> 
a_ a a a a a a a EXON1 a a a a a a a a > 

1360 1370 1380 1390 1400 1410 

TTT CTG GAG GAG GCG GTG CTC GAT AGC CCC GTC CCG AAC CCT AGC CCG AGG GCC GAA 
AAA GAC CTC CTC CGC CAC GAG CTA TCG GGG CAG GGC TTG GGA TCG GGC TCC CGG CTT 
F kEEAV L D S P V P N P S P R A E> 

a a a a a a a a EXON1 a a a a a a a a > 
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1420 1430 1^40 1450 1460 
GCG GGA GGG ATC AGG GGC GCA GGA GGG GTG GTG CCG GTC GAT GTT AAG CAG CCG CAG 
CGC CCT CCC TAG TCC CCG CGT CCT CCC CAC CAC GGC CAG CTA CAA TTC GTC GGC GTC 
A G G I RGAGG V V P V D V KQ P Q> 
a a_ a_ a a a a a EXON1 a a a a a a_ a_ a_ > 

1470 1480 1490 1500 1510 

1520 

CTC TCG GCG GCG GCG ACG ACG AGC GCG GTG GTG GAC CCC GTG GAG TAC AAC GCG ATG 
GAG AGC CGC CGC CGC TGC TGC TCG CGC CAC CAC CTG GGG CAC CTC ATG TTG CGC TAC 
LS AAATTSAVV DPVEYN A M> 
a a a a a a a a EXON1 a a a a a a a a _> 

1530 1540 1550 1560 1570 1580 
CTG AAG CAG AAG CTG GAG AAG GAC CTC GCC GCG GTC GCC ATG TGG AGG GTACAGC 
GAC TTC GTC TTC GAC CTC TTC CTG GAG CGG CGC CAG CGG TAC ACC TCC CATGTCG 
LKQKLEKDLAAVAMW R> 
a_ a a a a a EXON 1 a a a a a a a > 

1590 1600 1610 1620 1630 1640 1650 

CATTCTCCCC CCCTCTAGTA CTCGAGAGCT TAC TG AG ATC GGCAATGCTA GCTACTGTTT GCATCGAATG 
GTAAGAGGGG G GG AG ATC AT GAGCTCTCGA ATGACTCTAG CCGTTACGAT CGATGACAAA CGTAGCTTAC 

1660 1670 1680 1690 1700 1710 1720 

TTTATAGGTA TTTAGATCGG GCATTTCTAT AGACCAATGG CGTCCATGGT CTTGCAATGC GCTCTGTTGA 
AAATATCCAT AAATC TAGCC CGTAAAGATA TCTGGTTACC GCAGGTACCA G AAC GTT ACG CGAGACAACT 

1730 1740 1750 1760 1770 1780 1790 

GTGTCGGTGG TTGGTTCGAC TCATAGTATG TAGGGTTGTG CGTATGTACA AACGGAAGCT TCATAGACCT 
CACAGCCACC AACCAAGCTG AGTATCATAC ATCCCAACAC GCATACATGT TTGCCTTCGA AGTATCTGGA 

1800 1810 1820 1830 1840 1850 1860 

CGGTATTGAG ATTGCGATAT CGATGCAACC TGC GAATTGG CGATGTAATC AGTCATATTC TTACTAAACT 
GCCATAACTC TAACGCTATA GCTACGTTGG ACGCTTAACC GCTACATTAG TCAGTATAAG AATGATTTGA 

1870 1880 1890 1900 1910 1920 1930 

GCGAGACAGT GGTTTGTTTG CAATTGCAAT ATTTTTGTAT GGGGCTGCTT AAACTGTCAT TGCCTTTTTA 
CGCTCTGTCA CCAAACAAAC GTTAACGTTA TAAAAACATA CCCCGACGAA TTTGACAGTA ACGGAAAAAT 

1940 1950 1960 1970 1980 1990 2000 

GATTGGCAAT ATGTGACTTT ATGCAAGTAT TTGATTGGGC GGATCCAGGA ACAAAAAGTT GGGGGGATTC 
CTAACCGTTA TACACTGAAA TACGTTCATA AACTAACCCG CCTAGGTCCT TGTTTTTCAA CCCCCCTAAG 

2010 2020 2030 2040 2050 2060 2070 

AACATACCGA GTACACTGGC ATAAACACAT CATCTCAGTA TTAAACTATG CTAAAATGCT ATTAAGAGAC 
TTGTATGGCT CATGTGACCG TATTTGTGTA GTAGAGTCAT AATTTGATAC GATTTTACGA TAATTCTCTG 
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2080 2090 2100 2110 2120 2130 2140 

CTTTAGCACC TCTTATCTTA TCAACCATGG TGAAAAAATT GAAGGGGGGA CTCAGGGGGG TATCCATGGG 
GAAATCGTGG AGAATAGAAT AGTTGGTACC ACT TTTTT AA CTTCCCCCCT GAGTCCCCCC ATAGGTACCC 

2150 2160 2170 2180 ,2190 2200 2210 

TCCGATGGGT ■ GCAGGGGGGA CTGAGTCCCC CCTGCACCCA CGTTGAATCC GCCCTGGCAT GCGTATAAGC 
AGGCTACCCA CGTCCCCCCT GACTCAGGGG GGACGTGGGT GCAACTTAGG CGGGACCGTA CGCATATTCG 

2220 2230 2240 2250 2260 2270 2280 

TGTCACAGCC ATTTCTAGGT GCTTGTGCTT AG TTGGGTG A TGTCAGCTTA ATTTGTCTTT TCTATGTCGT 
ACAGTGTCGG TAAAGATCCA CGAACACGAA TCAACCCACT ACAGTCGAAT TAAACAGAAA AGATACAGCA 

2290 2300 2310 2320 2330 2340 2350 

CATCGATTTT CTAAGAAACG AAAAATAGCC TATTTATGTG CTCCAGAATT TGATGATCCC TGGCCCTTCA 
GTAGCTAAAA GATTCTTTGC TTTTTATCGG ATAAATACAC GAGGTCTTAA AC T AC T AGG G ACCGGGAAGT 

2360 2370 2380 2390 2400 2410 2420 

TTTGCTGAAA TTAGCCTATT TGTTGGTTGC CCTTCAGTTT TTTCCCAGCT TATGTTGTTG CAATGTGTGG 
AAACGACTTT AATCGGATAA ACAACCAACG GGAAGTCAAA AAAGGGTCGA ATACAACAAC GTTACACACC 

2430 2440 2450 2460 2470 2480 2490 

CTATGCCTCG TTTTGTGCCC TATAATTTAT TATTTGCAAT TCATTTTTGT ACATGACTTA AAATGAC AC T 
GATACGGAGC AAAACACGGG ATATTAAATA ATAAACGTTA AGTAAAAACA TGTACTGAAT TTTACTGTGA 

2500 2510 2520 2530 2540 2550 2560 

AGAGCAACAT GCACTGATTG GTTATCCTAT AATCATTTAT GTAGTTCTGT TCATTTTATC ATGCTAGCTC 
TCTCGTTGTA CGTGACTAAC CAATAGGATA TTAGTAAATA CATCAAGACA AGTAAAATAG TACGATCGAG 

2570 2580 2590 2600 2610 " 2620 

ATGTCATTTT CATCTTCAG GCC TCT GGC ACA GTT CCA CCT GAG CGT CCT GGA GCT GGT TCA 
TACAGTAAAA GTAGAAGTC CGG AGA CCG TGT CAA GGT GGA CTC GCA GGA CCT CGA CCA AGT 

A S G T V P P E R P G A G S> 
b b b b b EXON2 b b b b b b > 



2630 2640 2650 2660 2670 2680 
TCC TTG CTG AAT GCA GAT GTT TCA CAC ATA GGC GCT CCT AAT TCC ATC GGA GGTACTTA 
AGG AAC GAG TTA CGT CTA CAA AGT GTG TAT CCG CGA GGA TTA AGG TAG CCT CCATGAAT 
S LLNADV S H IGAP«NS I G> 
b b b b b b b EXON2 b b b b b b b > 



2690 2700 2710 2720 2730 2740 2750 

TCTTATCTGG TTACATTTTC AGATTGTTAT GAAACTACCC AAATATCCTG CACAATTGCA TGGGATTAAA 
AGAATAGACC AATGTAAAAG TCTAACAATA CTTTGATGGG TTTATAGGAC GTGTTAACGT ACCCTAATTT 



Fig. 42D 
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2760 2770 2780 2790 2800 2810 2820 

TTTTAGTTTC TTTGAAATAG AAGTAGAGTT GTATTGCTGT CACGTCATCA AATAGTTCTG AAGCTATGAA 
AAAATC AAAG AAACTTTATC TTCATCTCAA CATAACGACA GTGCAGTAGT TTATCAAGAC TTCGATACTT 

2830 2840 2850 2860 2870 2880 2890 

TAAATAAGTT CCGCATTTGT TAGTGATTCT TTGAACATTA GAATTGTTAT GCTTAAGTAG ATAGGGTTAT 
ATTTATTCAA GGCGTAAACA ATCACTAAGA AACTTGTAAT CTTAACAATA CGAATTCATC TATCCCAATA 

2900 2910 2920 2930 2940 2950 2960 

GTTTGTTTGG AGTTCCCTTA AATCATTTCA TTGCTGACTG CCAGCTGGCA GGAGCATTTG TTGTTGCCTT 
CAAACAAACC TCAAGGGAAT TTAGTAAAGT AACGACTGAC GGTCGACCGT CCTCGTAAAC AACAACGGAA 

2970 2980 2990 3000 3010 3020 3030 

GACCATGAAT GAAGACCTTC CTGTTCTGAG TGCTCACAAG AAAACATATT TTGATTAATG CACCTTGAAT 
CTGGTACTTA CTTCTGGAAG GACAAGACTC ACGAGTGTTC TTTTGTATAA AACTAATTAC GTGGAACTTA 

3040 3050 3060 3070 3080 3090 3100 

CCTTAGGATC TTGCAAAGAT GGGCACTTAG CTTTAGAATT GAGTAGTACT TAAATAGCTG TTGTTATCAT 
GGAATCCTAG AACGTTTCTA CCCGTGAATC GAAATCTTAA CTCATCATGA ATTTATCGAC AACAATAGTA 

3110 3120 3130 3140 3150 3160 3170 

GATTTGTCCT GTAGTGAAAT GTCGACAAAA CAGGAATGCT ACTTTTGACT TCTGATATTT CATGCCTGGC 
CTAAACAGGA CATCACTTTA CAGCTGTTTT GTCCTTACGA TGAAAACTGA AGACTATAAA GTACGGACCG 

3180 3190 3200 3210 3220 3230 

TTTACTTATG CTCTGTTTGG AACATGGGCA CATATCA GGC AAT GCT ACT CCA GTT CAA AAC ATG 
AAATGAATAC GAGACAAACC TTGTACCCGT GTATAGT CCG TTA CGA TGA GGT CAA GTT TTG TAC 

G N A T P V Q N M> 
C C C EXON3 C C C > 



3240 3250 3260 3270 3280 3290 
CTA AGT GGC CCA AGT GGG GGA TCG GGC TCA CAG TTG GTA CAG AAT GTT GAT GTC CTT 
GAT TCA CCG GGT TCA CCC CCT AGC CCG AGT GTC AAC CAT GTC TTA CAA CTA CAG GAA 
L S G P S G G S G S Q L V Q N V D V L> 
c c c c c c c _c EX ON 3 c c c c c c c c > 

3300 3310 3320 3330 3340 
GTA AAG CAG CCC ACC AGC TCT TCA TCA AGG GAG CAG TCA GAT GAT GAT GAC ATG AAG 
CAT TTC GTC GGG TGG TCG AGA AGT AGT TCC CTC GTC AGT CTA CTA CTA CTG TAC TTC 
VKQPT S S SSREQSDDDDM K> 
c c C c C c C c EXON 3 c c c c c c c c > 

3350 3360 3370 3380 3390 3400 
GGA GAA GCT GAG ACC ACT GGA ACT GCA AGA CCT GCT GAT CAA AGA TTA CAA CGA 
CCT CTT CGA CTC TGG TGA CCT TGA CGT TCT GGA CGA CTA GTT TCT AAT GTT GCT 
GEAET. TGTARPA"DQRLQR> 
c c c c c c c EXON3 __c c c c c c c c > 
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3410 • 3420 3430 3440 3450 3460 3470 

AGGTGATC ATTCATTGCT TCCTTGTAAT ATAGATTCTG TACATAATTA ACCTACCTCG TCATGCATGC 
TCCACTAG TAAGTAACGA AGGAACATTA TATCTAAGAC ATGTATTAAT TGGATGGAGC AGTACGTACG 

3480 3490 3500 3510 3520 3530 3540 

ATGTGTCCTA TTTTCACCTT AGCCCTTTCA GTTGGATTTC CACTTTCATC CGGTAGCCTT TCAGTTTCCT 
TACACAGGAT AAAAGTGGAA TCGGGAAAGT CAACCTAAAG GTGAAAGTAG GCCATCGGAA AGTCAAAGGA 

3550 3560 3570 3580 3590 3600 3610 

ATTGCATCGC ATATATGATC TTTTACCTAC CATATTAGTT CTCTGTGTGC CATACTCAGT GCTTAGTGTC 
TAACGTAGCG TAT AT AC TAG AAAATGGATG GTATAATCAA GAGACACACG GTATGAGTCA CGAATCACAG 

3620 3630 3640 3650 3660 3670 3680 

TCGAGCAAGA GAGGAATTTG TATGGCTATT ACACGTAGCA CTTTGCTCTC TACTTGTTTA TTGACATAAG 
AGCTCGTTCT CTCCTTAAAC ATACCGATAA TGTGCATCGT GAAACGAGAG ATGAACAAAT AACTGTATTC 

3690 3700 371.0 3720 3730 3740 3750 

CAATTTGGGA TGAATTAAAT CTGAGTTCAC ATCATATTCC TTATGTCACA AGTTTCTGAA ACCGATTGTA 
GTTAAACCCT ACTTAATTTA GACTCAAGTG TAGTATAAGG AATACAGTGT TCAAAGACTT TGGCTAACAT 

3760 3770 3780 3790 3800 3810 3820 

TCTAGTATCT GGTTGATGCA CCCCCATGTT GGATTTGCAA ATCAAAGTTA TACTCCCTAG AGAGCTTTAC 
AGATCATAGA CCAACTACGT GGGGGTAGAA CCTAAACGTT TAGTTTCAAT ATGAGGGATC TCTCGAAATG 

3830 3840 3850 3860 3870 3880 3890 

CTTTC AT AAA GCAATTACCC CAATAAACCA CGGATTTGAT AGC TATTG AC TATGATTACC AGAATTCATT 
GAAAGTATTT CGTTAATGGG GTTATTTGGT GCCTAAACTA TCGATAACTG AT AC TAATGG TCTTAAGTAA 

3900 3910 3920 3930 3940 3950 3960 

TGGCAGCTAT TTTCTCAATT TAAGTTTGGT ATTAGTCTCA GTTGGCTGTA AAATAATGTC ACGGTAGGGT 
ACCGTCGATA AAAGAGTTAA ATTCAAACCA TAATCAGAGT CAACCGACAT TTTATTACAG TGCCATCCCA 

3970 3980 3990 4000 4010 4020 4030 

ACATGTATGT GCAGCATACA AGGTATGGGT GAGTTATGAT ATGGACAGTG TGTACACCCC ACATTTGCTC 
TGTACATACA CGTCGTATGT TCCATACCCA CTCAATACTA TACCTGTCAC ACATGTGGGG TGTAAACGAG 

4O40 4050 4060 4070 4080 4090 4100 

ACTAAAATCA AAATATTCAA ACGTCACGTG ATGATATGGT GGATTGCATT ATACCTTGTA TTGTTTATTA 
TGATTTTAGT TTTATAAGTT TGCAGTGC AC TACTATACCA CCTAACGTAA TATGGAACAT AACAAATAAT 

4110 4120 4130 4140 4 150 4160 4170 

TGTTACTTGT GCTAGACAAT AATATAGGCT GTTCTTTTGG GTGATTTTGT ATGAAGATGT TGAGCAAGCA 
ACAATGAACA CGATCTGTTA TTATATCCGA CAAGAAAACC CACTAAAACA T AC TTCTAC A ACTCGTTCGT 

4180 4190 4200 4210 4220 4230 

CTTCTCGATA TAATGCTAGT TTTGTTGACC TGTTCC AGG AAG CAA TCC AAT CGG GAG TCA GCC 
GAAGAGCTAT ATTACGATCA AAACAACTGG ACAAGG TCC TTC GTT AGG TTA GCC CTC AGT CGG 

RKQSNRES A> 
d d _d EXON4 d d _d > 
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4240 4250 4260 4270 4280 4290 
AGG CGC TCA AGA AGC AGA AAG GCA GCT CAC TTG AAT GAG CTG GAG GCA CAG GTGTGA 
TCC GCG AGT TCT TCG TCT TTC CGT CGA GTG AAC TTA CTC GAC CTC CGT GTC CACACT 
R R S R S R K A A H L N E L E A Q> 
d_ d d d_ d d _d EXON4 _d d d d d d_ d. _> 



4300 4310 4320 4330 4340 4350 4360 

TAGTTCACAT AGTTATTTTC GATAAGACAT AAAATCCTAA ATTACTGGCT ACTGACTTCA GTTATGGATT 
ATCAAGTGTA TCAATAAAAG CTATTCTGTA TTTTAGGATT TAATGACCGA TG AC TG AAG T CAATACCTAA 

4370 4380 4390 4400 4410 4420 
TAC TTGTTAC AG GTA TCG CAA TTA AGA GTC GAG AAC TCC TCG CTG TTA AGG CGT CTT GCT 
ATGAACAATG TC CAT AGC GTT AAT TCT CAG CTC TTG AGG AGC GAC AAT TCC GCA GAA CGA 
V S Q L R V E N S S L L R R L A> 
e e e e e e EXON5 e e e e e e e > 



4430 4440 4450 4460 4470 
GAT GTT AAC CAG AAG TAC AAT GAT GCT GCT GTT GAC AAT AGA GTG CTA AAA GCA GAT 
CTA CAA TTG GTC TTC ATG TTA CTA CGA CGA CAA CTG TTA TCT CAC GAT TTT CGT CTA 
D V N Q K Y N D A A V D N R V L K A D> 
e e e e e e__ e e EXON5 e e_ e e e e e e > 



4480 4490 4500 4510 4520 4530 4540 

GTT GAG ACC TTG AGA GCA AAG GT ATG C TAT AT A TGCCTTTTGC AATATGCATC CCATGGATTG 
CAA CTC TGG AAC TCT CGT TTC CA TAC G ATATAT ACGGAAAACG TTATACGTAG GGTACCTAAC 
V E T L R A K> 
e e EXON5 e e > 



4550 4560 4570 4580 4590 4600 4610 

CTACTTTGGC TTGTTTCAAA CTTTCAACGT GACTTGTGTA CCCTGTTATT AGAAGAATAA TCCCGCCTAC 
GATGAAACCG AACAAAGTTT GAAAGTTGCA CTGAACACAT GGGACAATAA TCTTCTTATT AGGGCGGATG 

4620 4630 4640 4650 4660 4670 4680 

CATTATACTC TATAAATCAC CATTTGGCCA GTCCAAACAT GATTATTAAA TCAGGTCAAT CTGAACATTG 
GTAATATGAG ATATTTAGTG GTAAAC CGGT CAGGTTTGTA CTAATAATTT AGTCCAGTTA GACTTGTAAC 

4690 4700 4710 4720 4730 4740 

AAATGTATCA AAAATTCGCA GGTG AAG ATG GCA GAG GAC TCG GTG AAG CGG GTG ACA GGC 
TTTACATAGT TTTTAAGCGT CCAC TTC TAC CGT CTC CTG AGC CAC TTC GCC CAC TGT CCG 

K M A E D S V K R V T G> 
f f f f EXON6 £ f J. f f > 



4750 4760 4770 4780 4790 
ATG AAC GCG TTG TTT CCC GCC GCT TCT GAT ATG TCA TCC CTC AGC ATG CCA TTC AAC 
TAC TTG CGC AAC AAA GGG CGG CGA AGA CTA TAC AGT AGG GAG TCG TAC GGT AAG TTG 
MNA L FPA AS D M S S L S MP F N> 
f f f f f f f f EXON6 t f f f f f £ f > 
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4800 4810 4820 4830 4840 4850 
AGC TCC CCA TCT GAA GCA ACG TCA GAC GCT GCT GTT CCC ATC CAA GAT GAC CCG AAC 
TCG AGG GGT AGA CTT CGT TGC AGT CTG CGA CGA CAA GGG TAG GTT CTA CTG GGC TTG 
S S P S E A T S D A A V P I QDDP N> 
f f f f f f_ f f EXON6_ f f f f f f f f > 

4860 4870 4880 4890 4900 4910 

AAT TAC TTC GCT ACT AAC AAC GAC ATC GGA GGT AAC AAC AAC . TAC ATG CCC GAC ATA 
TTA ATG AAG CGA TGA TTG TTG CTG TAG CCT CCA TTG TTG TTG ATG TAC GGG CTG TAT 
N y FAT N N D I G G N N N Y M P D I> 

f f f f f f f f EXON6 f f f f f„ f f f. > 



4920 4930 4940 4950 4960 
CCT TCT TCG GCT CAG GAG GAC GAG GAC TTC GTC AAT GGC GCT CTG GCT GCC GGC AAG 
GGA AGA AGC CGA GTC CTC CTG CTC CTG AAG CAG TTA CCG CGA GAC CGA CGG CCG TTC 
p s SAQEDE D FV N GA L A A G K> 
_f £ f f f f f f EXON6 f f f f f f f f > 

4970 4980 4990 5000 5010 

5020 

ATT GGC CGG CCA GCC TCG CTG CAG CGG GTG GCG AGC CTG GAG CAT CTC CAG AAG AGG 
TAA CCG GCC GGT CGG AGC GAC GTC GCC CAC CGC TCG GAC CTC GTA GAG GTC TTC TCC 
I G R P A S L Q R V A S L E H L Q K R> 
___f f f f f f f f EXON6 f f f f f f f f > 

5030 5040 5050 5060 5070 5080 

ATG TGC GGT GGG CCG GCT TCG TCT GGG TCG ACG TCC TGA GAC CGA AACCCAGAGC 
TAC ACG CCA CCC GGC CGA AGC AGA CCC AGC TGC AGG ACT CTGGCT TTGGGTCTCG 
M CGGPASSGSTS *> 
_f f _f f f EXON6 f f f f f > 

5090 5100 5110 5120 5130 5140 5150 

TGCTTCGGTT CTGAAAGACA C TGC GAG CAG GAAATGATGA TTGGACAGGC GTAGACATTG CTAATGCTGT 
ACGAAGCCAA GACTTTCTGT GACGCTCGTC CTTTACTACT AACCTGTCCG CATCTGTAAC GATTACGACA 

5160 5170 5180 5190 5200 5210 5220 

GAGGTTGATG ATTGTTGGTC GTCGTCGTCG TCATTGTGCA TTCTTTGTAA GGGACACCTC TTAGTACCCT 
CTCCAACTAC TAACAACCAG CAGCAGCAGC AGTAACACGT AAGAAACATT CCCTGTGGAG AATCATGGGA 

5230 5240 5250 5260 5270 5280 5290 

CTTCTTCTAA GGGACTTAGT ACCCCTTGTG GATCTCATCG TCC TAAATAC TATACACATT AGCCAAATGT 
GAAGAAGATT CCCTGAATCA TGGGGAACAC CTAGAGTAGC AGGATTTATG ATATGTGTAA TCGGTTTACA 



>terminator 

I 

5300 5310 5320 5330 5340 5350 5360 

TCATTGGTGT GATGGCGTCG TCCCTAATTT GAACGACTGA TTTCAGGCAG CTGCTATGCT ATCATTCAAT 
AGTAACCACA CTACCGCAGC AGGGATTAAA CTTGCTGACT AAAGTCCGTC GACGATACGA TAGTAAGTTA 
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5370 5380 5390 5400 5410 5420 5430 

AATATTTTGA TCGATGCTTC CTCTTGTCTT TTGCTCTTAA GCAACCAAGC ATAAAGATAT CACTACCTTT 
TTATAAAACT AGCTACGAAG GAGAACAGAA AACGAGAATT CGTTGGTTCG TATTTCTATA GTGATGGAAA 

5440 5450 5460 5470 5480 5490 5500 

TGAGCTGTTC ATTTGAAGTG CAAAGCTAAG GTCAATATCT CAGGTGTTCA TTTGAAGTTT AAAGGTGAAC 
ACTCGACAAG TAAACTTCAC GTTTCGATTC GAGTTATAGA GTCCACAAGT AAAC TTC AAA TTTCCACTTG 

5510 5520 5530 5540 5550 5560 5570 

TGATAACAAA CGTCAGGCTA TGGTGAATGA AGGGACGTGT ACATCCCTAA TACATGTCAT TTTCATAATC 
ACTATTGTTT GCAGTCCGAT ACCACTTACT TGCCTGCACA TGTAGGGATT ATGTACAGTA AAAGTATTAG 

5580 5590 5600 5610 5620 5630 5640 

AAATTAGTTG ATGC ATTTTC ACCCAGAATC CCATCACAGT TCATCATACA AGCAAGTGTA GTTATTAATG 
TTTAATCAAC TACGTAAAAG TGGGTCTTAG GGTAGTGTCA AGTAGTATGT TCGTTCACAT CAATAATTAC 

5650 5660 5670 5680 5690 5700 5710 

GTAAATTTTT CGTTTAGAGA AAAAAAAAGG AAGCCTTATA TAAGATTCAC CGGTGGGGTG TGAACAATAA 
CATTTAAAAA GGAAATCTCT TTTTTTTTCC TTCGGAATAT ATTC TAAGTG GCCACCCCAC ACTTGTTATT 

5720 5730 5740 5750 5760 5770 5780 

TCAATGAATG AGATCGCATC CCGTAAGGGC AGCCTAGCTA GACAAAAATG CATAAAACTC GGTATACCAA 
AGTTACTTAC TCTAGCGTAG GGCATTCCCG TCGGATCGAT CTGTTTTTAC GTATTTTGAG GCATATGGTT 

5790 5800 5810 5820 5830 5840 5850 

CCACAACAAC GCTTGCGCAC GCGCTCAAAT GGCAGCGACT TCATCGCTTT CGCGGGCAAG AAAC GAATC A 
GGTGTTG TTG CGAACGCGTG CGCGAGTTTA CCGTCGCTGA AGTAGCGAAA GCGCCCGTTC TTTGCTTAGT 

5860 5870 5880 5890 5900 5910 5920 

AGTGATACAT TGGCAGGGAA CCACCAAAAG AAGGCCATCC AATCCAATCC ACTCCAACGC GGCATGGAAG 
TCACTATGTA ACCGTCCCTT GGTGGTTTTC TTCCGGTAGG TTAGGTTAGG TGAGGTTGCG CCGTACCTTC 

5930 5940 5950 5960 5970 5980 5990 

ACAAGACAGA TGATTCACAG CTATCTTCTG CTTCTACAAG TTTGATACTT TGTACTGTCC TTTCAGGGAA 
TGTTCTGTCT AC TAAGTG TC GATAGAAGAC GAAGATGTTC AAACTATGAA ACATGACAGG AAAGTCCCTT 

6000 6010 6020 6030 6040 6050 6060 

AAAAGAGCAT CAGATTAGTC TGATCTCGGG CGCGTTGAGT TCTTGTGGGA GATC TTGTTG TGG AG TGGC A 
TTTTCTCGTA GTCTAATCAG ACTAGAGCCC GCGCAACTCA AGAACACCCT CTAGAACAAC ACCTCACCGT 

6070 6080 6090 6100 6110 6120 6130 

GGAGTGACGA TCGGCTGCCC CGTTTTCTTC TACCGAAACA TCGCCAGTAA AGAAGCCAAA AAGACAATAA 
CCTCACTGCT AGCCGACGGG GCAAAAGAAG ATGGCTTTGT AGCGGTCATT TCTTCGGTTT TTCTGTTATT 

6140 6150 6160 6170 6180 6190 6200 

TACGGCAATG GGGATCGCCC ATCTGCATAA AACATTGCAT GACGGAACTG ATTAATACAA GAATGACATG 
ATGCCGTTAC CCCTAGCGGG TAGACGTATT TTGTAACGTA CTGCCTTGAC TAATTATGTT CTTACTGTAC 

6210 6220 
TAAGCTGATA ATTACGCGTG CAAGCTT 
ATTCGACTAT TAATGGGCAC GTTCGAA 
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Promoter Gene Terminator 

1 Glb-GUS L L TT > t >LLLllLlj 



Apromoler-Reb fcj I ■ j^t^^i. 

Gib-Gus i c — >mnm 



3 Glb-Reb jj I — ^ r^^i 
Glb-GUS 1 1 —i ^ >rmm3 



Gib-Gus rr : i — >nrmn 



5 Native-Reb <fllllllllHlIIHIII}|lilll]IIimt C>C3ZZI3 

Gib-Gus k ia» zr>[miiD 

Fig* 45A 



x 



.*;■.*;•.•;-.• 
. ■ *. • *. • • 

. ■*•.-•.■**.■> 

- :WV vv-: - 
A;A/A/ 

;A;A;.-\\ 



- . •» .■ ** *• 
'.•*.*.*.•••*/-■ 

.•*•.•*•.••».• 
.*.*.•/.•*.■.• 

^v-.v-.v 

• v -1 
.* « .* - • .* 




x 




*VVA/. 
/*vvv./.« 
/A/A;/. 

,*.*.*.**•.** 
..'.*;•"/"•.*•• 

*«•*.• *.•«. 

AV-.V-.V 
• 

'.■■■.•«*.•••. 
.*.*-*•■ 

/A/.-y.v/ 

•.*•.*■ •*■• * 

:^V.;A^ 

■vvvV-vv 



Fig. 45B 



020647 50A2 i > 



WO 02/064750 



PCT/US02/04909 



1 Gt1-GUS 



^romo7eP 



Gene 



Terminator 



muni 



Gt1-GUS 
Native-Reb 





>fim 


mil 


<ainnflfflBiiffliBnnnB™"iii" 







3 GU+UAS-GUS 




UAS (200 bp from Gib promoter) 
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Tcjvccrccrcrr<u>xiTc^ TTCT 

&tYCfti26A ,••••••♦•-«•••••---•'•-•**-»'»-***•»«♦•**♦*'• CCW^6GCACCTTOC*T<^CCTT* *TA 

PCR-CM2S JUJuoiU^TCC*^ACAC^^ 

BLYChi26A **A- A* A "A T ,-„*«*« 

f»Cfc-Chi26 TA- TTCATTACC AATCACTC JU^TTTTCri^T^^ 

BLYCK126A »•*••• -« , . • 

PCX -Ch 12 6 T — , OGCCATreACAAATACA* MnwrC AT^ 

&LYChl26X -rTCGOCCC>JkTXACAAAA>XXCAA A'C^ * * * ♦ * - — 

PCS-Chi26 TC^uarTAAAATC^TOCAAArTT^ 

8LrCKi26A „. A », * G «w* A ««« + - , . 

PCK-Ch\26 AJUrrrrTri^GCATt^A<5AAATATAAT^^ 

BLYChi_2&* C** -* — c .«.-.- „_„ c ..„*„ 

POR-CK126 TTTTCCC ju*CTCTAGCTTGTCCC1^^ 

BLYChl56A * ? wwc* — 

J>CR-Ch 1 2 6 CTTCTTTra-TCOCrCXTA^ 

eVYCni26A ••*-C 

?>CR-Cll±2 « AC AC - »AC<ftCGTCCAraCAC^^ 

8LYChA26A "*-*CT w«..- c -. — 

PC*-Chi2£ rr^cCTC^Tl^Ct^^ 

BLVO^L2€A " *T -«C~ — • —«<••«•<•• w-, 

8LYChi26A A'*'*** — w— • — • — 

KTR-*Chi26 aacXZATCCATCGA<^TA<xrrACCXXX^C>^ 

eJUYCT\126A • * 

PCR-Chi2 e Aj^AuCACACACACAC^ 

BJLVCM26A -«••-• * 

*>Cfc-C*i26 IXTTtCCCTCGCACA 



HUYCht26A 
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PCft-Lptl 

blyldcx 

PCK-Lptl 

BLYLptl 

PCR-LpCl 
BLYTLpcX 

PCR-X-PCl 

8L.YL.pt: 1 

BLVLptl 

BLYXptl 

PCK-L.pt 1 
BT.YX.pt; 2 

BLYLptl 

aLrLpci 

PCR-"LpCl 

PCR-CPCX 
BLYl-ptl 



TrCGAA CAATCTCCACCACTT AATTGGA<^C^^ 

c^rrcGA<^?AAec*rauukTcc^^ 

GCTTTaO.CTATAATOaTTC^CCTC^^ 
ATATOTACTA/SGATACCaAC^CX^^ 

aactastaaaaaaactaaa^attaaaa^ 

AATATAATAATATTTTTTOTU^^ 

TrrA6A-recA<*2AACTACj>^^ 

ACACtrnzXTTCrrAAATAJkAACTATCA 

TTCTTATCIO^ACACCWCCTAAC^ 

CTAGC^CriK^TCAACAC^ 

GATACAATCTQTTCTCCACrrAA^ 

CAA453^ A7-TTAAC CCCC^^ 

AGTACCTTCTACTXTTTA^rrACACAr^ 
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3056 bp 
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1 GAGCCTCCTAOGCTA GAAGAAATCCTAACC CCACC^J^TCTGA AGCAOGAAACAAATC AATTAAAGAAGCCAT 
i 6 AAATGATAAATGAAT AATCCTTCATGCATA AAAATATTTAGTTAT CCATAAAGAGACGAC AGACGTATGCAACC7 
151 TCAT^ri'AATTGTGA CTCTACAGTCTCACC GAGTTGTCGGTCCAA GCAAtTTTACCCATA CATGAAAACCATCTO 
226 AGAAAGGGCATGAGA GTAGTACTGCGTCAA TACACATGCAGTXGC ACACftGTAGCTGAGA QArTTOAfiCCGGAAAC 
301 ATCTGGTCGCCSGTC CACGATCGGTCtAGG CATGGGCTTCCCCCC S^tO^ft^^ 

37 6 GTCCACATGCATGCA CGGTG^TCGACATCC CGATCGACGTGTCGA (^TGCAGCCACCCC GCCGCTCGCTCGCTC 
4 51 GCGCTCGCGAGTCGC GCTAGCACGGACATA CGGAGTAGTATATGC GTATGCATAGGCACG CGCGCGCACGCGCGA 
526 1XXAGCGAACCAGAG Cf^fflTX^CgaCSflgfi CGGAGGCATAGCTTC CGCACGTACGTGTC C CGCTCCGGCQ^CX 
601 fiCCACGACGCGTGTC GCGGAGGCGCGCGCG GgCfiCCGAGCCGTCC GCGTCCKXGCSQg*g AggTOC GTTGCCGGC 
67 5 CCCCTGATCCCGCCG CGGAGTCGCCGCCTA l^&^GGAO&CGC fi^GCGTCGTCOaX £CCCATCAGCTCAAG 
7 51 CC GCAGAAG ACATAC ACACACAAACACAAG CCACCCTTCCGATTT GTTCATCGATCAGTT CGCAGCGTACGTCAG 

826 GCTAGCTAACTAGTG TTTGGCA ATG GCG TCC GGG CAG CA£ CAG CAG GGC AGG TCG GAG CXG GAC 

MASGQQQOGRSELD 

890 CGC ATG GCC AGG GAG GGC CAG ACC GTC GTC CCC GGC GGC ACC GGC GGC AAG AGO CTC GAG 
RMARE GOTVV P G G T G G K S L E- 

950 GCC CAG GAG AAC CTC GCC GAG G 1 GTATGCAAAAXAATA CTGAAACTTTTGATA GATCAAACTTGCCAT 
A 0 E N L A E 

1C1"? TTCATTCGGTATTTC GGTTGtfy\GTACGTAA CTAATATGCGTACGT GTGCATCGATCGATC AG 1 GG CGC AGC 

G R S 

1087 CGC GGG GGG CAG ACG AGO AAG GAG CAG ATG GGG GAG GAA GGG TAC CGC GAG ATG GGG CGC 
RGGQTRKEOMGESG VREMGK 

1147 AAG GGC GGC CTC AGC ACC GGC GAC GAG TCC GGC GGC GAG CGC GCC GCC CGC GAG GGC ATC 
KG GL S TGDES GGERAAREGI 

1207 GAC ATC GAC GAG TCC AAG TAC AAG ACC AAG TCC TAG ACTACACACACTTTT GCATCCGTGAATG 
DXDESK Y KTKS Stop 

1271 CCAGTGTOTCGTAGT CGTlXrTCAGTTGTAGT OVTCX^GraXGTCTT AGCTAGCTAGCTCTC Tt&TCWTftATAATG 

1346 TAGGTCTTGACGGAX G«CGC^TTTA<30Cd; OXGTATGATTTGCr ATGTTATGTWCATA TGGTGATGCGTGCTT 

1421 AGCTTTAGCTAGCrT TGGTAGTTTGG1T1A GGTGCTGGTCAGTAG CTAGGGTATGTCTTT TAGTTmTTGCAGGA 

1496 TCGGTCAGGCAGGGG CTGAATGCTAGCTAG GTGATGCTTTGGCXT TTCATGGCCAGCAGC TGCTAGCTTCTTGTT 

1571 TCXAACTGATGCCTC TGCTGGGTGTGTCCT TTCXhfiCCGT^^ GCTAGCTX3TTGTTCA 

1646 TCOGAASGTGCCAGT CttCTATGTCTGATCT ATCGC<3K5CTAATTCT TGCTGTTGCTAGTTG ATCTTTGATTCCTCA 

1721 GAGATGOCATGTGCA GGATOOAATTATTAG CGCAAAACACAGGAG AaGGGACAACAATTT TAAGTTGCCTAATTT 

1796 TTTTCTTAAAACACC TGOCt3GGCX?3TGCAC ACAAAAAAAAAATCA AGTACCTC3TCTAAA CTGAATGCTCAAGTT 

1671 TGAAAGTGAAAACAG GACGTGCCGTGOCAA GAACTTCGAGATATC TCCGAGTTCT 



Fig. 57 



BNSDOC1D: <WO 0206475OA2_L> 



WO 02/064750 



PCT/US02/04909 



74/75 




Fig. 58A Fig. 58D 
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SEQUENCE LISTING 

<110> Ventria Bioscience 

<120> -Expression System for Seed Proteins 

<130> 50665-8026. WO00 

<140> Not Yet Assigned 
<141> Filed Herewith 

<150> US 60/269, 188 
<151> 2001-02-14 

<150> US 60/269,199 
<151> 2001-02-14 

<150> US 09/847,232 
<151> 2001-05-02 

<160> 33 

<170> FastSEQ for Windows Version 4.0 

<210> 1 

<211> 393 

<212> DNA 

<213> Homo sapiens 



<400> 1 

aaagtcttcg agcggtgcga g'ctggcccgc acgctcaagc ggctcggcat ggacggctac 60 

cggggcatca gcctcgccaa ctggatgtgc ctcgccaagt gggagtcggg ctacaacacc 120 

cgcgcaacca actacaacgc cggcgaccgc tccaccgact acggcatctt ccagatcaac 180 

tcccgctact ggtgcaacga cggcaagacg cccggggccg tcaacgcctg ccacctctcc 240 

tgctcggccc tgctgcaaga caacatcgcc gacgccgtcg cg.tgcgcgaa gcgcgtcgtc 300 

cgcgacccgc agggcatccg ggcctgggtg gcctggcgca accgctgcca gaaccgggac 360 

gtgcgccagt acgtccaggg ctgcggcgtc tga 393 



<210> 2 

<211> 130 

<212> PRT 

<213> Homo sapiens 

<400> 2 



Lys 


Val 


Phe 


Glu 


Arg 


Cys 


Glu 


Leu 


Ala 


Arg 


Thr 


Leu 


Lys 


Arg 


Leu 


Gly 


1 








5 










10 










15 




Met 


Asp 


Gly 


Tyr 

20 


Arg 


Gly 


lie 


Ser 


Leu 
2 5 


Ala 


Asn 


Trp 


Met 


cys 

30 


Leu 


Ala 


Lys 


Trp 


GlU 
35 


Ser 


Gly 


Tyr 


Asn 


Thr 
40 


Arg 


Ala 


Thr 


Asn 


Tyr 
45 


Asn 


Ala 


Gly 


Asp 


Arg 
50 


Ser 


Thr 


Asp 


Tyr 


Gly 
55 


lie 


Phe 


Gin 


He 


Asn 
60 


Ser 


Arg 


Tyr 


Trp 


Cys 


Asn 


Asp 


Gly 


Lys 


Thr 


Pro 


Gly 


Ala 


Val 


Asn 


Ala 


Cys 


His 


Leu 


Ser 


65 










70 










75 










8 0 


Cys 


Ser 


Ala 


Leu 


Leu 


Gin 


Asp 


Asn 


lie 


Ala 


Asp 


Ala 


Val 


Ala 


cys 


Ala 








85 










90 










95 




Lys 


Arg 


Val 


Val 
100 


Arg 


Asp 


Pro 


Gin 


Gly 
105 


He 


Arg 


Ala 


Trp 


Val 
110 


Ala 


Trp 


Arg 


Asn 


Arg 


Cys 


Gin 


Asn 


Arg 


Asp 


Val 


Arg 


Gin 


Tyr 


Val 


Gin 


Gly 


Cys 



115 120 125 



Gly Val 
130 
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<211> 2079 

<212> DNA 

<213> Homo sapiens 



<400> 3 

gggcggcggc ggcgctcggt gcagtggtgc gccgtgtccc agcccgaggc gaccaagtgc 60 

ttccagtggc agcgcaacat gcggaaggtg cgcggcccgc cggtcagctg catcaagcgg 120 

gactccccca tccaatgcat ccaggccatc gcggagaacc gcgccgacgc ggtcaccctg 180 

gacggcgggt tcatctacga ggcggggctc gccccgtaca agctccgccc ggtggcggcg 240 

gaggtgtacg gcaccgagcg ccagccgcgc acgcactact acgcggtggc cgtcgtcaag 300 

aagggcgggt ccttccagct caacgagctg cagggcctga agtcgtgcca cacgggcctc 360 

cggcggacgg cgggctggaa cgtgcccatc g.gcaccctgc gccccttcct gaactggacc 420 

ggcccgccgg agccgatcga ggccgccgtg gcccgcttot tcagcgcctc ctgcgtcccc 480 

ggcgccgaca agggccagtt cccgaacctc tgccggctct gcgccgggac gggcgagaac 540 

aagtgcgcct tctcctcgca ggagccgtac ttctcctact cgggcgcgtt caagtgcctc 600 

cgcgacgggg ccggcgacgt ggcgttcatc cgcgagtcca ccgtgttcga ggacctctcc 660 

gacgaggcgg agcgggacga gtacgagctg ctgtgccccg acaacacccg caagccggtg 720 

gacaagttca aggactgcca cctggcgcgg gtgccctcgc acgcggtcgt cgcccgcagc 780 

gtcaacggca aggaggacgc gatctggaac ctcctccgcc aggcccagga gaagttcggc 840 

aaggacaagt cccccaagtt ccagctcttc gggagcccca gcggccagaa ggacctcctc 900 

ttcaaggact ccgcgatcgg cttctcccgc gtccccccgc gcatcgactc cggcctgtac 960 

ctcggctccg ggtacttcac cgcgatccag aacctccgga agagcgagga ggaggtggcg 1020 

gcgcggcggg cccgcgtcgt gtggtgcgcc gtgggcgagc aggagctgcg gaagtgcaac 1080 

cagtggagcg gcctgagcga ggggtcggtg acctgctcgt ccgccagcac caccgaggac 1140 

tgcatcgcgc tcgtcctcaa gggggaggcc gacgcgatga gcctcgacgg ggggtacgtc 1200 

tacaccgccg gcaagtgcgg cctggtcccg gtcctggcgg agaactacaa gtcgcagcag 1260 

tccagcgacc ccgacccgaa ctgcgtggac cgccccgtcg agggctacct cgccgtggcc 1320 

gtcgtgcgcc ggtccgacac ctccctgacg tggaacagcg tcaagggcaa gaagagctgc 1380 

cacaccgccg tggaccgcac cgccggctgg aacatcccga tgggcctcct cttcaaccag 1440 

accggctcct gcaagttcga cgagtagttc tcccagtcct gcgcccccgg ctcggacccc 1500 

cgctccaacc tgtgcgccct ctgcatcggg gacgagcagg gcgagaacaa gtgcgtgccc 1560 

aacagcaacg agcggtacta cggctacacg ggggccttcc gctgcctggc ggagaacgcc 1620 

ggggacgtcg cgttcgtgaa ggacgtgacc gtgctgcaaa acacggacgg gaacaacaac 1680 

gaggcgtggg cgaaggacct caagctcgcc gacttcgccc tgctgtgcct cgacggcaag 1740 

cgcaagcccg tcaccgaggc gcggtcctgc cacctggcga tggcccccaa ccacgccgtc 1800 

gtctcccgca tggacaaggt cgagcgcctc aagcaggtgc tcctgcacca gcaggccaag 1860 

ttcggccgga acggcagcga ctgcccggac aagttctgcc tgttccagtc ggagaccaag 1920 

aacctcctct tcaacgacaa caccgagtgc ctggcgcgcc tccacggcaa gaccacctac 1980 

gagaagtacc tcggcccgca gtacgtcgcc ggcatcacca acctcaagaa gtgctccacc 2040 

tcccccctcc tggaggcgtg cgagttcctc cgcaagtga 2079 



<210> 4 

<211> 690 

<212> PRT 

<213> Homo sapiens 



<400> 4 



Gly 


Arg 


Arg 


Arg 


Arg 


Ser 


Val 


Gin 


Trp 


Cys 


Ala 


Val 


Ser 


Gin 


Pro 


Glu 


1 








5 










10 










15 




Ala 


Thr 


Lys 


Cys 


Phe 


Gin 


Trp 


Gin 




Asn 


Met 


Arg 


Lys 


Val 


Arg 


Gly 








20 










25 










30 






Pro 


Pro 


Val 


Ser 


Cys 


lie 


Lys 


Arg 


Asp 


Ser 


Pro 


He 


Gin 


Cys 


He 


Gin 






35 










40 










45 








Ala 


He 


Ala 


Glu 


Asn 


Arg 


Ala 


Asp 


Ala 


Val 


Thr 


Leu 


Asp 


Gly 


Gly 


Phe 




50 










55 










60 










lie 


Tyr 


Glu 


Ala 


Gly 


Leu 


Ala 


Pro 


Tyr 


Lys 


Leu 


Arg 


Pro 


Val 


Ala 


Ala 


65 










70 










75 










80 


Glu 


Val 


Tyr 


Gly 


Thr 


Glu 


Arg 


Gin 


Pro 


Arg 


Thr 


His 


Tyr 


Tyr 


Ala 


Val 










85 










90 










95 




Ala 


Val 


Val 


Lys 


Lys 


Gly 


Gly 


Ser 


Phe 


Gin 


Leu 


Asn 


Glu 


Leu 


Gin 


Gly 








100 










105 










110 






Leu 


Lys 


Ser 


Cys 


His 


Thr 


Gly 


Leu 


Arg 


Arg 


Thr 


Ala 


Gly 


Trp 


Asn 


Val 






115 










120 










125 








Pro 


lie 


Gly 


Thr 


Leu 


Arg 


Pro 


Phe 


Leu 


Asn 


Trp 


Thr 


Gly 


Pro 


Pro 


Glu 



130 135 140 
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Pro 


He 


Glu 


Ala 


Ala 


Val 


Ala 


Arg 


Phe 


Phe 


Ser 


Ala 


Ser 


Cys 


Val 


Pro 


145 










150 










ICC 

loo 










1 tZ A 

160 


Gly Ala 


Asp 


Lys 


Gly 


Gin 


Phe 


Pro 


Asn 


Leu. 


Cys 


Arg 


Leu 


Cys 


Ala 


Gly 










165 










170 










17 5 




Thr 


Gly 


Glu 


Asn 


Lys 


Cys 


Ala 


Phe 


Ser 


Ser 


Gin 


Glu 


Pro 


Tyr 


Phe 


Ser 






180 










185 










190 






Tyr 


Ser 


Gly 


Ala 


Phe 


Lys 


Cys 


Leu 


Arg 


Asp 


Gly 


Ala 


Gly 


Asp 


Val 


Ala 






195 










200 










2 05 








Phe 


He 


Arg 


Glu 


Ser 


Thr 


Val 


Phe 


Glu 


Asp 


Leu 


Ser 


Asp 


Glu 


Ala 


Glu 




210 








215 










220 










Arg 


Asp 


Glu 


Tyr 


Glu 


Leu 


Leu 


C Y S 


Pro 


Asp 


Asn 


Thr 


Arg 


Lys 


Pro 


Val 


225 










230 










n n c 

2 35 










O A A 
2 H U 


Asp 


Lys 


Phe 


Lys 


Asp 


c y s 


His 


Leu 


Ala 


Arg 


Val 


Pro 


Ser 


His 


Ala 


Val 










245 










250 










o c c 

25 5 




Val 


Ala 


Arg 


Ser 


Val 


Asn 


Gly 


Lys 


Glu 


Asp 


Ala 


He 


Trp 


Asn 


Leu 


Leu 








260 








2 65 










270 






Arg 


Gin 


Ala 


Gin 


Glu 


Lys 


Phe 


Gly 


Lys 


Asp 


Lys 


Ser 


Pro 


Lys 


Phe 


Gin 






275 










280 










285 








Leu 


Phe 


Gly 


Ser 


Pro 


Ser 


Gly 


Gin 


Lys 


Asp 


Leu 


Leu 


Phe 


Lys 


Asp 


Ser 




290 










295 










300 










Ala 


He 


Gly 


Phe 


Ser 


Arg 


Val 


Pro 


Pro 


Arg 


lie 


Asp 


Ser 


Gly 


Leu 


Gly 


305 










310 










315 










O 1 A 

320 


Ser 


Gly 


Tyr 


Phe 


Thr 


Ala 


He 


Gin 


Asn 


Leu 


Arg 


Lys 


Ser 


Glu 


Glu 


Glu 






325 










330 










335 




Val 


Ala 


Ala 


Arg 


Arg 


Ala 


Arg 


Val 


Val 


Trp 


Cys 


Ala 


Val 


Gly 


Glu 


Gin 








340 










345 










o c r\ 

350 






Glu 


Leu 


Arg 


Lys 


Cys 


Asn 


Gin 


Trp 


Ser 


Gly 


Leu 


Ser 


Glu 


Gly 


Ser 


Val 






355 










360 










3 65 








Thr 


Cys 


Ser 


Ser 


Ala 


Ser 


Thr 


Thr 


Glu 


Asp 


Cys 


He 


Ala 


Leu 


Val 


Leu 




370 










375 










38 0 










Lys 


Gly 


Glu 


Ala 


Asp 


Ala 


Met 


Ser 


Leu 


Asp 


G1 y 


Gly 


Tyr 


Val 


Tyr 


Thr 


385 








390 










3 95 










4 00 


Ala 


Gly 


Lys 


Cys 


Gly 


Leu 


Val 


Pro 


Val 


Leu 


Ala 


Glu 


Asn 


Tyr 


Lys 


Ser 








4 05 










410 










A *1 C 

41 j 




Gin 


Gin 


Ser 


Ser 


Asp 


Pro 


Asp 


Pro 


Asn 


Cys 


Val 


Asp 


Arg 


Pro 


Val 


Glu 








420 










425 










A A 

4 oU 






Gly 


Tyr 


Leu 


Ala 


Val 


Ala 


Val 


Val 


Arg 


Arg 


Ser 


Asp 


Thr 


Ser 


Leu 


Thr 


435 










44 0 










A A C 

4 4 5 








Trp 


Asn 


Ser 


Val 


Lys 


Gly 


Lys 


Lys 


Ser 


Cys 


His 


Thr 


Ala 


Val 


Asp 


Arg 




4 50 










455 










a a a 
4 OU 










Thr 


Ala 


Gly 


Trp 


Asn 


He 


Pro 


Met 


Gly 


Leu 


Leu 


Phe 


Asn 


Gin 


Thr 


Gly 


465 










470 










A 1 C 

4 75 










A O A 

4 80 


Ser 


Gys 


Lys 


Phe 


Asp 


Glu 


Tyr 


Phe 


Ser 


Gin 


Ser 


Cys 


Ala 


Pro 


Gly 


Ser 






4 85 










A fl A 

4 90 










h y f o 




Asp 


Pro 


Arg 


Ser 


Asn 


Leu 


Cys 


Ala 


Leu 


Cys 


lie 


Gly 


Asp 


Glu 


Gin 


Gly 






500 










505 










510 






Glu 


Asn 


Lys 


Cys 


Val 


Pro 


Asn 


Ser 


Asn 


Glu 


Arg 


Tyr 


Tyr 


Gly 


Tyr 


Thr 






515 










520 










c o c 

52 5 








Gly Ala 


Phe 


Arg 


Cys 


Leu 


Ala 


Glu 


Asn 


Ala 


Gly 


Asp 


Val* 


Ala 


Phe 


Val 




530 










535 










C A A 

54 0 










Lys 


Asp 


Val 


Thr 


Val 


Leu 


Gin 


Asn 


Thr 


Asp 


Gly 


Asn 


Asn 


Asn 


Glu 


Ala 


545 








550 










555 










560 


Trp 


Ala 


Lys 


Asp 


Leu 


Lys 


Leu 


Ala 


Asp 


Phe 


Ala 


Leu 


Leu 


Cys 


Leu 


Asp 








565 










570 










57 5 




Gly 


Lys 


Arg 


Lys 


Pro 


Val 


Thr 


Glu 


Ala 


Arg 


Ser 


Cys 


His 


Leu 


Ala 


Met 




580 










585 
















Ala 


Pro 


Asn 


His 


Ala 


Val 


Val 


Ser 


Arg 


Met 


Asp 


Lys 


Val 


Glu 


Arg 


Leu 






595 










600 










60 5 








Lys 


Gin 


Val 


Leu 


Leu 


His 


Gin 


Gin 


Ala 


Lys 


Phe 


Gly 


Arg 


Asn 


Gly 


Ser 


610 










615 










62 0 










Asp 


Cys 


Pro 


Asp 


Lys 


Phe 


Cys 


Leu 


Phe 


Gin 


Ser 


Glu 


Thr 


Lys 


Asn 


Leu 


625 








630 










635 










640 


Leu 


Phe 


Asn 


Asp 


Asn 


Thr 


Glu 


Cys 


Leu 


Ala 


Arg 


Leu 


His 


Gly 


Lys 


Thr 








645 










650 










655 




Thr 


Tyr 


Glu 


Lys 


Tyr 


Leu 


Gly 


Pro 


Gin 


Tyr 


Val 
3 


Ala 


Gly 


He 


Thr 


Asn 
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660 665 670 

Leu Lys Lys Cys Ser Thr Ser Pro Leu Leu Glu Ala Cys Glu Phe Leu 
675 680 685 

Arg Lys 
690 

<210> .5 
<211> 42 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 5 

atcgaagctt catgagtaat gtgtgagcat tatgggacca eg 42 

<210> 6 
<211> 53 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 6 

ctagtctaga ctcgagccat ggggcegget agggagecat cgcacaagag gaa 53 

<210> 7 
<211> 72 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> codon optimized lactof erricin coding sequence based on Homo sapiens 
sequence 

<400> 7 

accaagtget tccagtggca gcgcaacatg ' cggaaggtgc gcggcccgcc ggtcagctgc 60 
ateaageggg ac 72 

<210> 8 
<211> 162 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> codon optimized EGF coding sequence based on Homo sapiens sequence 
<400> 8 

aactccgact cggagtgccc' cctctcccac gaeggttact gcctccacga eggggtctge 60 
atgtacatcg aggccc.tcga caagtacgcc tgcaactgcg tcgtgggcta categgegag 120 
cggtgccagt accgcgacct caagtggtgg gagctgeget ga 162 

<210> 9 
<211> 213 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> codon optimized IGF-1 coding sequence based on Homo sapiens 
sequence 

<400> 9 

ggeceggaga ccctctgcgg cgccgagctc gtggacgccc tccagttcgt gtgeggegae 60 
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cgcggcttct acttcaacaa gccgaccggc tacggcagca gcagccgccg cgccccgcag 120 
accggca'tcg tggacgagtg ctgcttccgc agctgcgacc tccgccgcct ggagatgtac 180 
tgcgccccgc tcaagcccgc caagagcgcc tga 213 

<210> 10 
<211> 1095 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> codon optimized lactohedrin coding sequence based on Homo sapiens 
sequence 



<400> 10 

ctggacatct gctcgaagaa cccgtgccac aacggcgggc tctgcgagga gatcagccag 60 

gaggtgcggg gcgacgtgtt cccctcgtac acctgcacct gcctgaaggg ctacgccggg 120 

aaccactgcg agacgaagtg cgtggagccc ctggggatgg agaacggeaa catcgccaac 180 

tcccagatcg ccgcctcctc cgtgcgggtg accttcctcg gcctccagca ctgggtcccg 240 

gagctggccc ggctcaaccg ggcgggcatg gtgaacgcgt ggaccccctc gtccaacgac 300 

gacaacccgt ggatccaagt gaacctgctc cgccgcatgt gggtcaccgg cgtggtcacc 360 

caaggcgcca gccgcctggc cagccacgag tacctcaagg ccttcaaggt cgcctacagc 420 

ctcaacggcc acgagttcga cttcatccac gacgtcaaca agaagcacaa ggagttcgtg 480 

ggcaactgga acaagaacgc ggtccacgtg aacctcttcg agacccccgt cgaggcccag 540 

tacgtccgcc tctaccccac gagctgccac accgcctgca cgctccgctt cgagctgctg 600 

gggtgcgagc tgaacgggtg cgcgaacccg ctggggctca agaacaacag catccccgac .660 

aagcagatca cggcctcgtc gtcgtacaag acctggggcc tgcacctctt ctcgtggaac 720 

ccgagctacg cccggctgga caagcagggc aacttcaacg cctgggtcgc cgggagctac 780 

gggaacgacc agtggctcca ggtggacctc ggcagctcca aggaggtcac cggcatcatc 840 

acgcaggggg cccgcaactt cggctccgtg cagttcgtgg cctcctacaa ggtggcctac 900 

tcgaacgaca gcgccaactg gaccgagtac caggacccgc gcaccgggtc cagcaagatc 960 

ttccccggca actgggacaa ccacagccac aagaagaacc tgttcgagac cccca.tc.ctc 1020 

gcccggtacg tccgcatcct ccccgtcgct tggcacaacc ggatcgcgct ccggctggag 1080 

ctcctcggct gctga 1095 



<210> 11' 
<211> 489 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> codon optimized kappa-casein coding sequence based on Homo sapiens 
sequence 



<400> 11 

gaggtccaaa accagaagca gcccgcctgc cacgagaacg acgagcgccc cttctaccag 60 

aagaccgcac cctacgtccc gatgtactac gtcccgaaca gctaccccta ctacggtacg 120 

aacctgtacc agcgccgccc ggccatcgct atcaacaacc cctacgtccc ccggacctac 180 

tacgcgaacc cggccgtggt gcggccccac gcgcagatcc cgcagcggca gtacctgcca 240 

aacagccacc cccccaccgt ggtgcggcgg cccaacctcc acccgagctt catcgctatc 300 

ccccccaaga agatccagga caagatcatc atcccgacca tcaacaccat cgccaccgtg 360 

gagccgacgc cagcccccgc gaccgagccc acggtggaca gcgtcgtgac cccagaggcg 420 

ttctccgaat cgatcatcac ctccaccccc gagaccacca cggtggccgt cacgccgccg 480 

acggcatga 489 



<210> 12 
<211> 1233 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> codon optimized haptocorrin coding sequence based on Homo sapiens 
sequence . 

<400> 12 

gagatctgcg aggtctccga ggagaactac atccgcctca agcccctcct gaacaccatg 60 

5 
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atccagagca actacaaccg gggcacgtcg gccgtgaacg tcgtgctctc cctgaagctc 120 

gtgggcatcc agatccagac cctcatgcag aagatgatcc agcagatcaa gtacaacgtg 180 

aagagccgcc tctcggacgt gtccagcggc gagctggcgc tcatcatcct cgcgctcggc 240 

gtgtgccgga acgcggagga gaacctcatc tacgactacc acctcacgga caagctggag 300 

aacaagttcc aggccgagat cgagaacatg gaggcccaca acggcacccc gctgaccaac 360 

tactaccagc tcagcctgga cgtcctcgcg ctctgcctgt tcaacgggaa ctactccacc 420 

gccgaggtgg tcaaccactt cacccccgag aacaagaact actacttcgg ctcgcagttc 480 

tccgtggaca ccggggccat ggccgtcctg gccctcacct gcgtgaagaa gtccctcatc 540 

aacggccaga tcaaggccga cgagggctcc ctgaagaaca tctcgatcta cacc.aagagc 600 

ctcgtggaga agatcctcag cgagaagaag gagaacgggc tgatcggcaa caccttctcg 660 

accggcgagg cgatgcaggc cctgttcgtg ageagcgact actacaacga gaacgactgg 720 

aactgccagc agaccctcaa cacggtcctg accgagatca gccagggcgc gttcagcaac 780 

cccaacgccg ccgcccaggt cctgccggcc ctgatgggca agaccttcct cgacatcaac 840 

aaggacagct cctgcgtgtc cgcgagcggc aacttcaaca tctccgccga cgagccgatc 900 

acggtgacgc cgcccgacag ccagtcgtac atctccgtga actacagcgt gcggatcaac 960 

gagacctact tcacgaacgt gacggtcctc aacggctcgg tcttcctgag cgtgatggag 1020 

aaggcgcaga agatgaacga cacgatcttc ggcttcacga tggaggagcg cagctggggc 1080 

ccctacatca cctgcatcca gggcctctgc gccaacaaca acgaccgcac ctactgggag 1140 

ctgctgagcg gcggcgagcc gctgagccag ggggccggca gctacgtggt ccgcaacggc 1200 

gagaacctgg aggtccggtg gagcaagtac tga 1233 



<210> 13 
<211> 2061 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> codon optimized lactoperoxidase coding sequence based on Homo 
sapiens sequence 



<400> 13 

caaacgaccc ggacgtcggc gatctccgac acggtctcgc aggccaaggt gcaagtcaac 60 

aaggcattcc tggattcgcg cacgcggctg aagaccgcga tgtcgtccga gaccccgacg 120 

agccggcagc tgagcgagta cctcaagcac gcgaaggggc ggacgcgcac cgccatccgc 180 

aatggccaag tgtgggagga atccctgaag cggctgcggc agaaggcgtc gctcaccaac 240 

gtgaccgacc cgtccctcga cctgaccagc ctctccctgg aggtcggctg cggcgccccg 300 

gcgcccgtcg tgcgctgcga cccctgctcg ccataccgca cgatcacggg cgactgcaac 360 

aaccggcgga agccggcact gggggctgcg aaccgcgccc tcgcgcgctg gctccccgcc 420 

gagtacgagg acggcctcag cctccccttc ggttggaccc ccggcaagac gcgcaacggc 480 

ttcccgctcc cgctcgctcg cgaggtcagc aacaagatcg tcggttacct gaacgaggag 540 

ggggtcctcg accaaaaccg ctccctcctc ttcatgcagt gggggcagat cgtggaccac 600 

gacctggact tcgccccgga cacggagctg ggctccagcg agtacagcaa gacccagtgc 660 

gacgaatact gcatccaggg cgacaactgc ttcccgatca tgttcccccc gaacgacccg 720 

aaggcgggca cccagggcaa gtgcatgccg ttcttccggg caggcttcgt ctgcccgacc 780 

cccccgtaca agtccctcgc gcgcgagcag atcaacgcgc tcacgtcctt cctcgacgcc 840 

agcttcgtct acagcagcga gccgtccctc gccagccgcc tccgcaacct cagcagcccc 900 

ctcggcctca tggcggtcaa ccaggaggtg tcggaccacg gcctcccata cctgccgtac 960 

gacagcaaga agccgtcccc ctgcgagttc atcaacacca ccgcgcgcgt cccgtgcttc' 1020 

ctcgccggcg attcgcgggc gagcgagcac atcctcctcg ccacgagcca caccctgttc 1080 

ctccgcgagc acaaccgcct cgcccgggag ctgaagcgcc tcaacccgca gtgggacggc 1140 

gagaagctct accaggaggc ccggaagatc ctcggcgctt tcgtccagat catcaccttc 1200 

cgggactacc tccccatcct gctcggtgac cacatgcaga agtggatccc cccctaccaa 1260 

ggctactccg agagcgtgga cccgcgcatc tccaacgtct tcacgttcgc gttccgcttc 1320 

gggcacctgg aggtgccgtc gtcgatgttc cgcctcgacg agaactacca gccctggggc 1380 

ccagagccgg agctgccgct ccacaccctg ttcttcaaca cctggcggat ggtcaaggac 1440 

ggcggcatcg acccgctcgt gcgcgggctc ctggctaaga agtcgaagct catgaagcag 1500 

aacaagatga tgaccggcga gctgcgcaac aagctgttcc agcccaccca ccgcatccac 1560 

gggttcgacc tggctgcaat caacacccag cggtgccgcg accacggcca gcccggctac 1620 

aactcgtggc gcgcgttctg cgacctctcc cagccacaga cgctggagga gctcaacacc 1680 

gtgctcaaga gcaagatgct cgccaagaag ctgctcgggc tctacggcac gcccgacaac 1740 

atcgacatct ggatcggggc catcgcggag ccgctcgtgg agcgcgggcg cgtcggcccg 1800 

ctgctcgcgt gcctcctggg caagcaattc caacagatcc gcgacgggga ccggttctgg 1860 

tgggagaacc ccggcgtgtt caccaacgag cagaaggatt cgctccaaaa gatgagcttc 1920 

tcccgcctgg tgtgcgacaa cacccgcatc accaaggtcc cgcgcgaccc attctgggcc 1980 

aactcctacc cgtacgactt cgtggactgc tccgccatcg acaagctcga cctgtccccc 2040 



6 
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tgggcatcgg tgaagaactg a 2061 
14 

1185 
DNA 

Artificial Sequence 
<220> 

<223> codon optimized alpha-l-antitrypsin coding sequence based on Homo 
sapiens sequence 



<400> 14 

gaggacccgc agggcgacgc cgcccagaag accgacacca gccaccacga ccaggaccac 60 

ccgacgttca acaagatcac cccgaatttg gccgaattcg ccttcagcct gtaccgccag 120 

ctcgcgcacc agtccaactc caccaacatc ttcttcagcc cggtgagcat cgccaccgcc 180 

ttcgccatgc tgtccctggg taccaaggcg gacacccacg acgagatcct cgaagggctg 240 

aacttcaacc tgacggagat cccggaggcg cagatccacg agggcttcca ggagctgctc 300 

aggacgctca accagccgga ctcccagctc cagctcacca ccggcaacgg gctcttcctg 360 

tccgagggcc tcaagctcgt cgataagttc ctggaggacg tgaagaagct ctaccactcc 420 

gaggcgttca ccgtcaactt cggggacacc gaggaggcca agaagcagat caacgactac 480 

gtcgagaagg ggacccaggg caagatcgtg gacctggtca aggaattgga cagggacacc 540 

gtcttcgcgc tcgtcaacta catcttcttc aagggcaagt gggagcgccc gttcgaggtg 600 

aaggacaccg aggaggagga cttccacgtc gaccaggtca ccaccgtcaa ggtcccgatg 660 

atgaagaggc tcggcatgtt caacatccag cactgcaaga agctctccag ctgggtgctc 720 

ctcatgaagt acctggggaa cgccaccgcc atcttcttcc tgccggacga gggcaagctc 780 

cagcacctgg agaacgagct gacgcacgac atcatcacga agttcctgga gaacgaggac 840 

aggcgctccg ctagcctcca cctcccgaag ctgagcatca ccggcacgta cgacctgaag 900 

agcgtgctgg gccagctggg catcacgaag gtcttcagca acggcgcgga cctctccggc 960 

gtgacggagg aggcccccct gaagctctcc aaggccgtgc acaaggcggt gctcacgatc 1020 

gacgagaagg ggacggaagc tgccggggcc atgttcctgg aggccatccc cgtgtccatc 1080 

ccgcccgagg tcaagttcaa caagcccttc gtcttcctga tgatcgagca gaacacgaag 1140 

agccccctct tcatggggaa ggtcgtcaac cccacgcaga agtga 1185 



<210> 15 
<211> 786 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Rice Gtl promoter and Gtl leader coding sequence 



<4 00> 15 

catgagtaat gtgtgagcat tatgggacca cgaaataaaa agaacatttt gatgagtcgt 60 

gtatcctcga tgagcctcaa aagttctctc accccggata agaaaccctt aagcaatgtg 120 

caaagtttgc attctccact gacataatgc aaaataagat atcatcgatg acatagcaac 180 

tcatgcatca tatcatgcct ctctcaacct attcattcct actcatctac ataagtatct 240 

tcagctaaat gttagaacat aaacccataa gtcacgtttg atgagtatta ggcgtgacac 300 

atgacaaatc acagactcaa gcaagataaa gcaaaatgat gtgtacataa aactccagag 360 

ctatatgtca tattgcaaaa agaggagagc ttataagaca aggcatgact cacaaaaatt 420 

cacttgcctt tcgtgtcaaa aagaggaggg ctttacatta tccatgtcat attgcaaaag 4 80 

aaagagagaa agaacaacac aatgctgcgt caattataca tatctgtatg tccatcatta 540 

ttcatccacc tttcgtgtac cacacttcat atatcataag agtcacttca cgtctggaca 600 

ttaacaaact ctatcttaac atttagatgc aagagccttt atctcactat aaatgcacga 660 

tgatttctca ttgtttctca caaaaagcgg ccgcttcatt agtcctacaa caacatggca 720 

tccataaatc gccccatagt tttcttcaca gtttgcttgt tcctcttgtg cgatggctcc 780 

ctagcc 786 



<210> 16 

<211> 1055 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Rice Gib promoter and Gtl leader coding sequence 
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<400> 16 

ctgcagggag gagaggggag agatggtgag agaggaggaa gaagaggagg ggtgacaatg 60 

atatgtgggg catgtgggca cccaattttt taattcattc ttttgttgaa actgacatgt 120 

gggtcccatg agatttatta tttttcggat cgaatcgcca cgtaagcgct acgtcaatgc 180 

tacgtcagat gaagaccgag tcaaattagc cacgtaagcg ccacgtcagc caaaaccacc 240 

atccaaaccg ccgagggacc tcatctgcac tggttttgat agttgaggga cccgttgtat 300 

ctggtttttc gattgaagga cgaaaatcaa atttgttgac aagttaaggg accttaaatg 360 

aacttattcc atttcaaaat attctgtgag ccatatatac cgtgggcttc caatcctcct 420 

caaattaaag ggccttttta aaatagataa ttgccttctt tcagtcaccc ataaaagtac 480 

aaaactacta ccaacaagca acatgcgcag ttacacacat tttctgcaca tttccgccac 540 

gtcacaaaga gctaagagtt atccctagga caatctcatt agtgtagata catccattaa 600 

tcttttatca gaggcaaacg taaagccgct ctttatgaca aaaataggtg acacaaaagt 660 

gttatctgcc acatacataa cttcagaaat tacccaacac caagagaaaa ataaaaaaaa 720 

atctttttgc aagctccaaa tcttggaaac ctttttcact ctttgcagca ttgtactctt 780 

gctctttttc caaccgatcc atgtcaccct caagcttcta cttgatctac acgaagctca 840 

ccgtgcacac aaccatggcc acaaaaaccc tataaaaccc catccgatcg ccatcatctc 900 

atcatcagtt cattaccaac aaacaaaaga ggaaaaaaaa catatacact tctagtgatt 960 

gtctgattga tcatcaatct agaggcggcc gcatggctag caaggtcgtc ttcttcgcgg 1020 

cggcgctcat ggcggccatg gtggccatct ccggc 1055 



<210> 17 
<211> 976 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Bx7 promoter 



<400> 17 

ctgcaggcca gggaaagaca atggacatgc aaagaggtag gggcagggaa gaaacacttg 60 

gagatcatag aagaacataa gaggttaaac ataggagggc ataatggaca attaaatcta 120 

cattaattga actcatttgg gaagtaaaca aaatccatat tctggtgtaa atcaaactat 180 

ttgacgcgga tttactaaga tcctatgtta attttagaca tgactggcca aaggtttcag 240 

ttagttcatt tgtcacggaa aggtgttttc ataagtccaa aactctacca acttttttgc 300 

acgtcatagc atagatagat gttgtgagtc attggataga tattgtgagt cagcatggat 360 

ttgtgttgcc tggaaatcca actaaatgac aagcaacaaa acctgaaatg ggctttagga 420 

gagatggttt atcaatttac atgttccatg caggctacct tccactactc gacatggtta 480 

gaagttttga gtgccgcata tttgcggaag caatggcact actcgacatg gttagaagtt 540 

ttgagtgccg catatttgcg gaagcaatgg ctaacagata catattctgc caaaccccaa 600 

gaaggataat cactcctctt agataaaaag aacagaccaa tgtacaaaca tccacacttc 660 

tgcaaacaat acaccagaac taggattaag cccattacgt ggctttagca gaccgtccaa 720 

aaatctgttt tgcaagcacc aattgctcct tacttatcca gcttcttttg tgttggcaaa 780 

ctgccctttt ccaaccgatt ttgtttcttc tcacgctttc ttcataggct aaactaacct 840 

cggcgtgcac acaaccatgt cctgaacctt cacctcgtcc ctataaaagc ccatccaacc 900 

ttacaatctc atcatcaccc acaacaccga gcaccccaat ctacagatca attcactgac 960 

agttcactga tctaga 976 



<210> 18 
<211> 1009 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Glub-2 promoter 



<400> 18 

ctgcagtaat ggatacctag tagcaagcta gcttaaacaa atctaaattc caatctgttc 60 

gtaaacgttt tctcgatcgc aattttgatc aaaactattg aaaacctcaa ttaaaccatt 120 

caaaattttt aatataccca acaagagcgt ccaaaccaaa tatgtaaata tggatgtcat 180 

gataattgac ttatgacaat gtgattattt catcaagtct ttaaatcatt aattctagtt 240 

gaaggtttat gttttcttat gctaaagggt tatgtttata taagaatatt aaagagcaaa 300 

ttgcaataga tcaacacaac aaatttgaat gtttccagat gtgtaaaaat atccaaatta 360 

attgttttaa aatagtttta agaaggatct gatatgcaag tttgatagtt agtaaactgc 420 

aaaagggctt attacatgga aaattcctta ttgaatatgt ttcattgact ggtttatttt 480 

acatgacaac aaagttacta gtatgtcaat aaaaaaatac aaggttactt gtcaattgta 540 
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ttgtgccaag taaagatgac aacaaacata caaatttatt tgttctttta tagaaacacc 600 

taacttatca aggatagttg gccacgcaaa aatgacaaca tactttacaa ttgtatcatc 660 

ataaagatct tatcaagtat aagaacttta tggtgacata aaaaataatc acaagggcaa . 720 

gacacatact aaaagtatgg acagaaattt cttaacaaac tccatttgtt ttgtatccaa 780 

aagcataaga aatgagtcat ggctgagtca tgatatgtag ttcaatcttg caaaattgcc 84 0 

tttttgttaa gtattgtttt aacactacaa gtcacatatt gtctatactt gcaacaaaca 900 

ctattaccgt gtatcccaag tggccttttc attgctatat aaactagctt gatcggtctt 960 

tcaactcaca tcaattagct taagtttcca ttagcaactg ctaatagct 1009 



<210> 19 
<211> 839 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Gt3 promoter 



<400> 19 

ctgcagtgta agtgtagctt cttatagctt agtgctttac tatcttcaca agcacatgct 60 

atagtattgt tccaagatga aagaataatt catccttgct accaacttgc atgatattat 120 

atttgtgaat atcctatctc ttggcttata atgaaatgtg ctgctgggtt attctgacca 180 

tggtatttga gagcctttgt atagctgaaa ccaacgtata tcgagcatgg aacagagaac 240 

aaaatgcaag gattttttta ttctggttca tgccctggat gggttaatat cgtgatcatc 300 

aaaaaagata tgcataaaat taaagtaata aatttgctca taagaaacca aaaccaaaag 360 

cacatatgtc ctaaacaaac tgcattttgt ttgtcatgta gcaatacaag agataatata 420 

tgacgtggtt atgacttatt cactttttgt gactccaaaa tgtagtaggt ctaactgatt 480 

gtttaaagtg atgtcttact gtagaagttt catcccaaaa . gcaatcacta aagcaacaca 540 

cacgtatagt ccac.cttcac gtaattcttt gtggaagata acaagaaggc tcactgaaaa 600 

ataaaagcaa agaaaaggat atcaaacaga ccattgtgca tcccattgat ccttgtatgt 660 

ctatttatct atcctccttt tgtgtacctt acttctatct agtgagtcac ttcatatgtg 720 

gacattaaca aactctatct taacatctag tcgatca'cta ctttacttca ctataaaagg 780 

accaacatat atcatccatt tctcacaaaa gcattgagtt cagtcccaca aaatctaga 839 



<210> 20 
<211> 1302 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Glub-1 promoter 



<400> 20 

ctgcagagat atggattttc taagattaat tgattctctg tctaaagaaa aaaagtatta 60 

ttgaattaaa tggaaaaaga aaaaggaaaa aggggatggc ttctgctttt tgggctgaag 120 

gcggcgtgtg gccagcgtgc tgcgtgcgga cagcgagcga acacacgacg gagcagctac 180 

gacgaacggg ggaccgagtg gaccggacga ggatgtggcc taggacgagt gcacaaggct 240 

agtggactcg gtccccgcgc ggtatcccga gtggtccact gtctgcaaac acgattcaca 300 

tagagcgggc agacgcggga gccgtcctag gtgcaccgga agcaaatccg tcgcctgggt 360 

ggatttgagt gacacggccc acgtgtagcc tcacagctct ccgtggtcag atgtgtaaaa 420 

ttatcataat atgtgttttt caaatagtta aataatatat ataggcaagt tatatgggtc 480 

aataagcagt aaaaaggctt atgacatggt aaaattactt acaccaatat gccttactgt 540 

ctgatatatt ttacatgaca acaaagttac aagtacgtca tttaaaaata caagttactt 600 

atcaattgta gtgtatcaag taaatgacaa caaacctaca aatttgctat tttgaaggaa 660 

cacttaaaaa aatcaatagg caagttatat agtcaataaa ctgcaagaag gcttatgaca 720 

tggaaaaatt acatacacca atatgcttta ttgtccggta tattttacaa gacaacaaag 780 

ttataagtat gtcatttaaa aatacaagtt acttatcaat tgtcaagtaa atgaaaacaa 840 

acctacaaat ttgttatttt gaaggaacac ctaaattatc aaatatagct tgctacgcaa 900 

aatgacaaca tgcttacaag ttattatcat cttaaagtta gactcatctt ctcaagcata 960 

agagctttat ggtgcaaaaa caaatataat gacaaggcaa agatacatac atattaagag 1020 

tatggacaga catttcttta acaaactcca tttgtattac tccaaaagca ccagaagttt 1080 

gtcatggctg agtcatgaaa tgtatagttc aatcttgcaa agttgccttt ccttttgtac 1140 

tgtgttttaa cactacaagc catatattgt ctgtacgtgc aacaaactat atcaccatgt 1200 

atcccaagat gcttttttat tgctatataa actagcttgg tctgtctttg aactcacatc 1260 

aattagctta agtttccata agcaagtaca aatagctcta ga 1302 
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<210> 21 
<211> 675 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Rice proalmin promoter 



<400> 21 

ctgcagcatc ggcttaggtg tagcaacacg actttattat tattattatt attattatta 60 

ttattttaca aaaatataaa atagatcagt ccctcaccac aagtagagca agttggtgag 120 

ttattgtaaa gttctacaaa gctaatttaa aagttattgc attaacttat ttcatattac 180 

aaacaagagt gtcaatggaa caatgaaaac catatgacat actataattt tgtttttatt 240 

attgaaatta tataattcaa agagaataaa tccacatagc cgtaaagttc tacatgtggt 300 

gcattaccaa aatatatata gcttacaaaa catgacaagc ttagtttgaa aaattgcaat 360 

ccttatcaca ttgacacata aagtgagtga tgagtcataa tattattttt cttgctaccc 420 

atcatgtata tatgatagcc acaaagttac tttgatgatg atatcaaaga acatttttag 480 

gtgcacctaa cagaatatcc aaataatatg actcacttag atcataatag agcatcaagt 540 

aaaactaaca ctctaaagca accgatggga aagcatctat aaatagacaa gcacaatgaa 600 

aatcctcatc atccttcacc acaattcaaa tattatagtt gaagcatagt agtagaatcc 660 

aacaacaatc tagag 675 



<210> 22 
<211> 1098 
<212> DNA 

<213> Artificial Sequence 
<22 0> 

<223> Rice cysteine peptidase promoter 



<400> 22 

ccaggcttca tcctaaccat tacaggcaag atgttgtatg aagaagggcg aacatgcaga 60 

ttgttaaact gacacgtgat ggacaagaat gaccgattgg tgaccggtct gacaatggtc 120 

atgtcgtcag cagacagcca tctcccacgt cgcgcctgct tccggtgaaa gtggaggtag 180 

gtatgggccg tcccgtcaga aggtgattcg gatggcagcg atacaaatct ccgtccatta 240 

atgaagagaa gtcaagttga aagaaaggga gggagagatg gtgcatgtgg gatccccttg 300 

ggatataaaa ggaggacctt gcccacttag aaaggagagg agaaagcaat cccagaagaa 360 

tcgggggctg actggcactt tgtagcttct tcatacgcga atccaccaaa acacaggagt 420 

agggtattac gcttctcagc ggcccgaacc tgtatacatc gcccgtgtct tgtgtgtttc 480 

cgctcttgcg aaccttccac agattgggag cttagaacct cacccagggc ccccggccga 540 

actggcaaag gggggcctgc gcggtctccc ggtgaggagc cccacgctcc gtcagttcta 600 

aattacccga tgagaaaggg aggggggggg gggaaatctg ccttgtttat ttacgatcca 660 

acggatttgg tcgacaccga tgaggtgtct taccagttac cacgagctag attatagtac 720 

taattacttg aggattcggt tcctaatttt ttacccgatc gacttcgcca tggaaaattt 780 

tttattcggg ggagaatatc caccctgttt cgctcctaat taagatagga attgttacga 840 

ttagcaacct aattcagatc ag.aattgtta gttagcggcg ttggatccct cacctcatcc 900 

catcccaatt cccaaaccca aactcctctt ccagtcgccg acccaaacac ' gcatccgccg 960 

cctataaatc ccacccgcat cgagcctatc aagcccaaaa aaccacaaac caaacgaaga 1020 

aggaaaaaaa aaggaggaaa agaaaagagg aggaaagcga agaggttgga gagagacgct 1080 

cgtctccacg tcgccgcc 1098 



<210> 23. 
<211> 432 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Barley D-Hordian promoter 



<400> 23 

cttcgagtgc ccgccgattt gccagcaatg gctaacagac acatattctg ccaaaacccc 60 

agaacaataa tcacttctcg tagatgaaga gaacagacca agatacaaac gtccacgctt 120 

cagcaaacag taccccagaa ctaggattaa gccgattacg cggctttagc agaccgtcca 180 

aaaaaactgt tttgcaaagc tccaattcct ccttgcttat ccaatttctt ttgtgttggc 240 

aaactgcact tgtccaaccg attttgttct tcccgtgttt cttcttaggc taactaacac 300 
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agccgtgcac atagccatgg tccggaatct tcacctcgtc cctataaaag cccagccaat 360 
ctccacaatc tcatcatcac cgagaacacc gagaaccaca aaactagaga tcaattcatt 420 
gacagtccac eg 432 

<210> 24 
<211> 60 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> bx7 signal peptide sequence 
<400> 24 

atggctaagc gcctggtcct etttgeggea gtagtegteg ccctcgtggc tctcaccgcc 60 

<210> 25 
<211> 72 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Glub~2 signal peptide sequence 
<400> 25 

atggcaacta ccattttctc tegtttttet atatactttt gtgetatget attatgecag 60 
ggttctatgg cc 72 

<210> 26 
<211> 85 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Gt3 signal peptide sequence 
<400> 26 

atgtggacat taacaaactc tatcttaaca tctagtcgat cactacttta cttcactata 60 
aaaggaccaa catatatcat ccatt 85 

<210> 27 
<211> 72 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Glub~2 signal peptide sequence 
<400> 27 

atggcgagtt ccgttttctc teggttttet atatactttt gtgttcttct attatgecat 60 
ggttctatgg cc 72 

<210> 28 
<211> 69 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> proalmin signal peptide sequence 
<400> 28 

atgaagatca ttttegtatt tgctctcctt gctattgttg catgcaacgc ttctgeaegg 60 
tttga tget 69 

<210> 29 
<211> 63 
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<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Rice cysteine peptidase signal peptide sequence 
<400> 29 

atggccgccc gcgccgccgc cgccgcgttc ctgctgctgc tcatcgtcgt tggtcaccgc 60 
gcc 63 

<210> 30 

<211> 63 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> D~Hordian signal peptide sequence 

<400> 30 

atggctaagc ggctggtcct ctttgtggcg gtaatcgtcg ccctcgtggc tctcaccacc 60 
gcc 63 



<210> 31 

<211> 1314 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> 02 transcription factor sequence 



<400> 31 

atggagcacg tcatctcaat ggaggagatc ctcgggccct tctgggagct gctaccaccg 60 

ccagcgccag agccagagcg agagcagcct ccggtaaccg gcatcgtcgt cggcagtgtc 120 

atagacgttg ctgctgctgg tcatggtgac ggggacatga tggatcagca gcacgccaca 180 

gagtggacct ttgagaggtt actagaagag gaggctctga cgacaagcac accgccgccg 240 

gtggtggtgg tgccgaactc ttgttgctca ggcgccctaa atgctgaccg gccgccggtg 300 

atggaagagg cggtaactat ggcgcctgcg gcggtgagta gtgccgtagt aggtgacccc 360 

atggagtaca atgccatact gaggaggaag ctggaggagg acctcgaggc cttcaaaatg 420 

tggagggcgg cctccagtgt tgtgacctca gatcaacgtt ctcaaggctc aaacaatcac 480 

actggaggta gcagcatcag gaataatcca gtgcagaaca agctgatgaa cggcgaagat S40 

ccaatcaaca ataaccacgc tcaaactgca ggccttggcg tgaggcttgc tactagctct 600 

tcctcgagag atccttcacc atcagacgaa gacatggacg gagaagtaga gattctgggg 660 

ttcaagatgc ctaccgagga aagagtgagg aaaagaaagg aatccaatag agaatcagcc 720 

agacgctcga gatacaggaa agccgctcac ctgaaagaac tggaagacca ggtagcacag 780 

ctaaaagccg agaattcttg cctgctgagg cgcattgccg ctctgaacca gaagtacaac 840 

gacgctaacg tcgacaacag ggtgctgaga gcggacatgg agaccctaag agctaaggtg 900 

aagatgggag aggactctct gaagcgggtg atagagatga gctcatcagt gccgtcgtcc 960 

atgcccatct cggcgccgac ccccagctcc gacgctccag tgccgccgcc gcctatccga 1020 

gacagcatcg tcggctactt ctccgccaca gccgcagacg acgatgcttc ggtcggcaac 1080 

ggtttcttgc gactgcaagc tcatcaagag cctgcatcca tggtcgtcgg tggaactctg 1140 

agcgccacag agatgaaccg agtagcagca gccacgcatt gcgcgggggc catggagcac 1200 

atccagacgg cgatgggatc catgccgccg acctccgcct ccggatctac accgccgccg 1260 

caggattatg agctgctggg tccaaatggg gccatacaca tggacatgta ttag 1314 



<210> 32 
<211> 987 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PBF transcription factor sequence 
<400> 32 

atggacatga tctccggcag cactgcagca acatcaacac cccacaacaa ccaacaggcg 60 
gtgatgttgt catcccccat tataaaggag gaagctaggg acccaaagca gacacgagcc 120 
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atgccccaaa taggtggcag tggggagcgt aagccgaggc cgcaactacc tgaggcgctc 180 

aagtgcccac gctgcgactc caacaacacc aagttttgct actacaacaa ttatagcatg 240 

tcacaaccac gctacttttg caaggcttgc cgccgctatt ggacacatgg tggtaccctc 300 

cgcaatgtcc ccattggtgg tgggtgtcgc aagaacaaac atgcctctag atttgtcttg 360 

ggctctcaca cctcatcgtc ctcatctgct acctatgcac cattatcccc tagcaccaac 420 

gctagctcta gcaatatgag catcaacaaa catatgatga tggtgcctaa catgaegatg 480 

cctaccccaa cgacaatggg cttattccct aatgtgctcc caacacttat gccgacaggt 540 

ggaggcgggg gctttgactt cactatggac aaccaacata gatcattgtc cttcacacca 600 

atgtctctac ctagccaggg gccagtgcct atgctggctg caggagggag tgaggcaaca 660 

ccgtctttcc tagagatgct gagaggaggg atttttcatg gtagtagtag ctataacaca 720 

agtctcacga tgagtggtgg caacaatgga atggacaagc cattttcgct gccatcatat 780 

ggtgcaatgt gcacaaatgg gttgagtggc tcaaccacta atgatgccag acaactggtg 840 

gggcctcagc aggataacaa ggccatcatg aagagcagta ataacaacaa tggtgtatca 900 

ttgttgaacc tctactggaa caagcacaac aacaacaaca acaacaacaa caacaacaac 960 

aacaacaaca acaacaaggg acaataa 987 



<210> 33 
<211> 3902 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Reb transcription factor sequence 



<400> 33 

atggagcggg tgttctccgt ggaggagatc tccgacccat tctgggtccc gcctccgccg 60 

ccgcagtcgg cggcggcggc ccagcagcag ggcggcggcg gcgtggcttc gggaggtggt 120 

ggtggtgtag cggggggcgg cggcggcggg aacgcgatga accggtgccc gtcggagtgg 180 

tacttccaga a'gtttctgga ggaggcggtg ctcgatagcc ccgtcccgaa ccctagcccg 240 

agggccgaag cgggagggat caggggcgca ggaggggtgg tgccggtcga tgttaagcag 300 

ccgcagctct cggcggcggc gacgacgagc gcggtggtgg accccgtgga gtacaacgcg 360 

atgctgaagc agaagctgga gaaggacctc gccgcggtcg ccatgtggag ggtacagcca 420 

ttctcccccc ctctag-tact cgagagctta ctgagatcgg caatgctagc tactgtttgc 480 

atcgaatgtt tataggtatt tagatcgggc atttctatag accaatggcg tccatggtct 540 

tgcaatgcgc tctgttgagt gtcggtggtt ggttcgactc atagtatgta gggttgtgcg 600 

tatgtacaaa cggaagcttc atagacctcg gtattgagat tgcgatatcg atgcaacctg 660 

cgaattggcg atgtaatcag tcatattctt actaaactgc gagacagtgg tttgtttgca 720 

attgcaatat ttttgtatgg ggctgcttaa actgtcattg cctttttaga ttggcaatat 780 

gtgactttat gcaagtattt gattgggcgg atccaggaac aaaaagttgg ggggattcaa 840 

cataccgagt acactggcat aaacacatca tctcagtatt aaactatgct aaaatgctat . 900 

taagagacct ttagcacctc ttatcttatc aaccatggtg aaaaaattga aggggggact 960 

caggggggta . tccatgggtc cgatgggtgc aggggggact gagtcccccc tgcacccacg 1020 

ttgaatccgc cctggcatgc gtataagctg tcacagccat ttctaggtgc ttgtgcttag 1080 

ttgggtgatg tcagcttaat ttgtcttttc tatgtcgtca tcgattttct aagaaacgaa . 1140 

aaatagccta tttatgtgct ccagaatttg atgatccctg gcccttcatt tgctgaaatt 1200 

agcctatttg ttggttgccc ttcagttttt tcccagctta tgttgttgca atgtgtggct 1260 

atgcctcgtt ttgtgcccta taatttatta tttgcaattc atttttgtac atgacttaaa 1320 

atgacactag agcaacatgc actgattggt tatcctataa tcatttatgt agttctgttc 1380 

attttatcat gctagctcat gtcattttca tcttcaggcc tctggcacag ttccacctga 1440 

gcgtcctgga gctggttcat ccttgctgaa tgcagatgtt tcacacatag gcgctcctaa 1500 

ttccatcgga ggtacttatc ttatctggtt acattttcag attgttatga aactacccaa 1560 

atatcctgca caattgcatg ggattaaatt tta.gtttctt tgaaatagaa gtagagttgt 1620 

attgctgtca cgtcatcaaa tagttctgaa gctatgaata aataagttcc gcatttgtta 1680 

gtgattcttt gaacattaga attgttatgc ttaagtagat agggttatgt ttgtttggag 1740 

ttcccttaaa tcatttcatt gctgactgcc agctggcagg agcatttgtt gttgccttga 1800 

ccatgaatga agaccttcct gttctgagtg ctcacaagaa aacatatttt gattaatgca 1860 

ccttgaatcc ttaggatctt gcaaagatgg gcacttagct ttagaattga gtagtactta 1920 

aatagctgtt gttatcatga tttgtcctgt agtgaaatgt cgacaaaaca ggaatgctac 1980 

ttttgacttc tgatatttca tgcctggctt tacttatgct ctgtttggaa catgggcaca 2040 

tatcaggcaa tgctactcca gttcaaaaca tgctaagtgg cccaagtggg ggatcgggct 2100 

cacagttggt acagaatgtt gatgtccttg taaagcagcc caccagctct tcatcaaggg 2160 

agcagtcaga tgatgatgac atgaagggag aagctgagac cactggaact gcaagacctg 2220 

ctgatcaaag attacaacga aggtgatcat tcattgcttc cttgtaatat agattctgta 2280 

cataattaac ctacctcgtc atgcatgcat gtgtcctatt ttcaccttag ccctttcagt 2340 

tggatttcca ctttcatccg gtagcctttc agtttcctat tgcatcgcat atatgatctt 2400 
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ttacctacca tattagttct ctgtgtgcca tactcagtgc ttagtgtctc gagcaagaga 2460 

ggaatttgta tggctattac acgtagcact ttgctctcta cttgtttatt gacataagca 2520 

atttgggatg a'attaaatct gagttcacat catattcctt atgtcacaag tttctgaaac 2580 

cgattgtatc tagtatctgg ttgatgcacc cccatcttgg atttgcaaat caaagttata 2640 

ctccctagag agctttacct ttcataaagc aattacccca ataaaccacg gatttgatag 2700 

ctattgacta tgattaccag aattcatttg gcagctattt tctcaattta agtttggtat 2760 

tagtctcagt tggctgtaaa ataatgtcac ggtagggtac atgtatgtgc agcatacaag 2820 

gtatgggtga gttatgatat ggacagtgtg tacaccccac atttgctcac taaaatcaaa 2880 

atattcaaac gtcacgtgat gatatggtgg attgcattat accttgtatt gtttattatg 2940 

ttacttgtgc tagacaataa tataggctgt tcttttgggt gattttgtat gaagatgttg 3000 

agcaagcact tctcgatata atgctagttt tgttgacctg ttccaggaag caatccaatc 3060 

gggagtcagc caggcgctca agaagcagaa aggcagctca cttgaatgag ctggaggcac 3120 

aggtgtgata gttcacatag ttatttt cga taagacataa aatcctaaat tactggctac 3180 

tgacttcagt tatggattta cttgttacag gtatcgcaat taagagtcga gaactcctcg 3240 

ctgttaaggc gtcttgctga tgttaaccag aagtacaatg atgctgctgt tgacaataga 3300 

gtgctaaaag cagatgttga gaccttgaga gcaaaggtat gctatatatg ccttttgcaa 3360 

tatgcatccc atggattgct actttggctt gtttcaaact ttcaacgtga cttgtgtacc 3420 

ctgttattag aagaataatc ccgcctacca ttatactcta taaatcacca tttggccagt 3480 

ccaaacatga ttattaaatc aggtcaatct gaacattgaa atgtatcaaa aattcgcagg 3540 

tgaagatggc agaggactcg gtgaagcggg tgacaggcat gaacgcgttg tttcccgccg 3600 

cttctgatat gtcatccctc agcatgccat tcaacagctc cccatctgaa gcaacgtcag 3660 

acgctgctgt tcccatccaa gatgacccga acaattactt cgctactaac aacgacatcg 3720 

gaggtaacaa caactacatg cccgacatac cttcttcggc tcaggaggac gaggacttcg 3780 

tcaatggcgc tctggctgcc ggcaagattg gccggccagc ctcgctgcag cgggtggcga 3840 

gcctggagca tctccagaag aggatgtgcg gtgggccggc ttcgtctggg tcgacgtcct 3900 

ga 3902 
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This international report has not been established in respect of certain claims under Article 17(2)(a) for the following reasons: 
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because they relate to subject matter not required to be searched by this Authority, namely: 



j | Claim Nos.: 

because they relate to parts of the international application that do not comply with the prescribed requirements to 
such an extent that no meaningful international search can be carried out, specifically: 
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Claim Nos.; 25,37-45,49-52 and 77-79 

because they are dependent claims and are not drafted in accordance with the second and ihkd sentences of Rule 



Box II Observations where unity of invention is lacking (Continuation of Item 2 of first sheet) 

This International Searching Authority found multiple inventions in this international application, as follows: 
Please See Continuation Sheet 



As all required additional search fees were timely paid by the applicant, this international search report covers all 
searchable claims. 

2< [ I As ^1 s ea r c bable claims could be searched without effort justifying an additional fee, this Authority did not invite 
payment of any additional fee. 

3 * {XI As only some of the required additional search fees were timely paid by the applicant, this international search 
report covers only those claims for which fees were paid, specifically claims Nos.: 1,3-6,8-23,26,28- 
36,47,48,53,56-69, 1 18 and 119 



4- 1 1 No required additional search fees were timely paid by the applicant. Consequently, this international search report 
is restricted to the invention first mentioned in the claims; it is covered by claims Nos.: 

Remark on Protest [ 1 The additional search fees were accompanied by the applicant's protest. 

No protest accompanied the payment of additional search fees. 
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BOX IL OBSERVATIONS WHERE UNITY OF INVENTION IS LACKING 

This application contains the following inventions or groups of inventions, Vvhich are not so linked as to form a single general 
inventive concept under PCT Rule 13. 1. In order for all inventions to be examined, the appropriate additional examination fees must 
be paid. 

Group I, claim(s) 1 , 3-6, 8-23, 26, 28-36, 47, 48 and 118-119, drawn to a method of producing a heterologous polypeptide in a grain 
of a plant comprising a chimeric gene comprising a rice glutelin Gtl promoter, said chimeric gene, an expression vector comprising 
said chimeric gene and a transformed host and its parts comprising said chimeric gene. 

Group II, claim(s) 2-22, 24, 27-36, 46, 48, 118 and 1 19, drawn to a method of producing a heterologous polypeptide in a grain of a 
plant comprising a chimeric gene comprising a rice glutelin Gib promoter, said chimeric gene, an expression vector comprising said 
chimeric gene and a transformed host and its parts comprising said chimeric gene. 

Group III, claim(s) 54 and 55, drawn to a codon optimised nucleic acid molecule, 

Group IV, claim(s) 56-69, drawn to a method of producing a heterologous polypeptide in a grain of a plant comprising a chimeric 
gene comprising a promoter of a storage protein gene. 

Group V, claim(s) 70-72, drawn to an expression vector comprising an aleurone-specific promoter. 

Group VI, claim(s) 73-76 drawn to a transformed host comprising a chimeric gene comprising an aleurone-specific promoter, 

Group VII, claim(s) 80-89, drawn to a method of processing a transgenic seed that comprises a heterologous polypeptide. 

Group VIII, claim(s) 90-92 and 95-97, drawn to a method of processing transgenic grains containing a heterologous polypeptide to 
produce a malt syrup. 

Group IX, claim(s) 93-97, drawn to a method of processing transgenic grains containing a heterologous polypeptide to produce a malt 
extract. 

Group X, claim(s) 98-110, drawn to a transgenic malt syrup or malt extract, various products comprising said syrup or extract and 
various products comprising a protein-containing product. 

Group XI, claim(s) 111-117, drawn to a method for producing a selected protein other than a monocot-plant seed-storage protein. 



The inventions listed as Groups I-XI do not relate to a single general inventive concept under PCT Rule 13.1 because, under PCT 
Rule 13.2, they lack the same or corresponding special technical features for the following reasons: 

The broadest general concept of the instant invention can be found in Group IV, claims 56-69, as a method of producing a 
heterologous polypeptide in a grain of a plant comprising a chimeric gene comprising a promoter of a storage protein, a nucleic acid 
leader encoding a signal peptide of a storage protein and a heterologous nucleic acid that encodes the heterologous polypeptide. This 
general concept was known in the art prior to applicant' s invention and thus does not make a contribution over the prior art as 
required under PCT Rule 13.2. See, for example, Vanderkerckhove et al (U.S. Patent 5,487,991, published 30 January 1996) 
especially the claims. Hence, the instant claims as a whole do not relate to one invention or group of inventions so linked as to form 
a single general inventive concept as required under PCT Rule 13. 1, nor do all of the claimed inventions contribute over the prior art 
as required under PCT Rule 13.2, 

In addition, the chimeric gene and the expression vector of Group I cannot be used in the method of Group II, and the optimized 
nucleic acid molecule. of Group III, a third product, is not required by the methods of Group I or Group II. The invention of Group 
IV, a third method requires promotes not requied by any other group. Group V, drawn to a fourth product, requires a promoter not 
required by any other group. Group VI, a fifth product, requires microbial hosts not required by any other group. Groups VH-IX, 
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drawn to fourth through sixth methods, require particular processing steps and reagents not required by each other or any other group. 
Group X, a sixth product, requires chemical components distinct from any other group, Group XI, a seventh method, requires 
monocot transformation and protein extraction methods not required by the other groups. 
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